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A  GEOMETRIC  TREATMENT  OF  IMPLICIT 


DIFFERENTIAL-ALGEBRAIC  EQUATIONS1 

BY 

Patrick  J.  Rabier  and  Werner  C.  Rheinboldt 
Department  of  Mathematics  and  Statistics 
University  of  Pittsburgh 
Pittsburgh,  PA  15260 

Abstract.  A  differential-geometric  approach  for  proving  the  existence  and  uniqueness  of 
solutions  of  implicit  differential-algebraic  equations  is  presented.  It  provides  for  a  significant 
improvement  of  an  earlier  theory  developed  by  the  authors  as  well  as  for  a  completely  intrinsic 
definition  of  the  index  of  such  problems.  The  differential-algebraic  equation  is  transformed 
into  an  explicit  ordinary  differential  equation  by  a  reduction  process  that  can  be  abstractly 
defined  for  specific  submanifolds  of  tangent  bundles  here  called  reducible  rr-submanifolds. 
Local  existence  and  uniqueness  results  for  differential-algebraic  equations  then  follow  directly 
from  the  final  stage  of  this  reduction  by  means  of  an  application  of  the  standard  theory  of 
ordinary  differential  equations. 


1.  Introduction. 

An  implicit  differential  equation 

(1-1)  F(t.x,x)  =  0,  F  :  R  x  Rn  x  Rn  — >  Rn, 

with  a  sufficiently  smooth  mapping  F  is  usually  referred  to  as  a  differential-algebraic 
equation  (DAE)  when  the  partial  derivative  DrF{t,x,p)  has  constant  rank  p  <  n  on  the 
(open)  domain  of  F ,  or.  more  generally,  when  the  constant  rank  condition  holds  in  some 
open  neighborhood  of  F-1(0)  in  R  x  Rn  x  Rn. 

1  This  work  was  supported  in  part  by  ONR-grant  N-00014-90-J-1025,  XSF-grant  CCR-8907654,  and 
AFOSR-grant  90-0094 
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In  [11]  we  presented  a  general  existence  and  uniqueness  theory  for  such  DAEs  which 
confirms  the  commonly  accepted  idea  that,  in  some  way,  any  reasonable  DAE  can  be 
reduced  -  at  least  locally  -  to  an  explicit  ODE.  Broadly  speaking,  a  sequence  of  equations 

(1.2)  Fj{t,x,i)  =  0.  F0  =  F.  F:SxSnxSn-*  Rn,  j  =0,1,...  , 

is  constructed  provided  each  equation  is  a  DAE:  that  is,  DpFj  has  constant  rank.  The 
sequence  terminates  with  j  =  v  when  DpFv  has  full  rank.  Then  the  equation  (1.2)  with 
j  =  v  is  reducible  to  an  explicit  ODE  and  v  is  called  the  index  of  the  problem.  The 
solutions  of  (1.1)  automatically  satisfy  all  equations  (1.2)  of  the  sequence  and,  conversely, 
every  solution  of  a  particular  equation  (1.2)  with  consistent  initial  data  solves  (1.1).  Herr 
the  consistency  condition  reflects  the  well-known  fact  that,  in  contrast  with  ODEs,  a  DAE 
(1.1)  does  not  have  locally  defined  solutions  for  arbitrary  initial  data  in  F-1(0).  The 
reduction  process  fails  when  one  of  the  equations  (1.2)  is  a  singular  differential  equation, 
for  which  the,  as  yet  quite  incomplete,  existence  theory  is  substantially  different  from  that 
for  ODEs  or  DAEs  (see  [10]). 

The  concept  of  an  index  plays  an  important  role  in  the  study  of  DAEs.  For  linear  DAEs 

(1.3)  F(t.  x,p )  =  Ax  +  Bp  —  f(t),  f  :  E  — ►  R" 

with  constant  n  x  n  matrices  A  and  B  such  that  rank  B  =  p  <  n,  the  index  has  been 
defined  (see  [3],  [7])  as  the  index  of  the  matrix  pencil  (A,  B)  in  the  sense  of  [5].  For 
linear  systems  (1.3)  the  theory  in  [11]  requires  the  same  hypotheses  and  leads  to  the 
same  value  of  the  index  as  the  classical  theory.  For  the  general  case  (1.1)  various  authors 
(see.  e.g.,  the  monographs  [2],  [9].  and  [G],  [S]  for  references)  have  introduced  definitions 
which  generalize  the  linear  index.  These  definitions  serve  well  for  classifying  DAEs  and 
for  characterizing  properties  of  numerical  procedures,  but,  by  themselves,  they  do  not 
incorporate  any  existence  results  for  solutions  of  the  equations. 

Although  the  reduction  process  in  [11]  provides  a  general  setting  for  a  solvability  theory 
of  DAEs,  it  turns  out  not  to  be  intrinsic  since  the  sequence  {F;}  is  not  uniquely  determined 
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by  the  mapping  F.  In  fact,  for  the  construction  of  the  Fj ,  various  equally  reasonable 
approaches  axe  feasible  and  it  is  by  not  clear  whether  the  resulting  solvability  criteria  are 
equivalent  and,  in  particular,  whether  the  index  of  the  problem  is  independent  of  {Fj}.  A 
principal  aim  of  this  paper  is  the  development  of  a  completely  intrinsic  geometric  approach 
for  the  existence  and  uniqueness  of  DAEs  that  extends  the  analytic  treatment  of  [11]. 

By  adding,  as  usual,  the  equation  t  =  1  we  can  transform  (1.1)  into  an  autonomous 
problem.  Thus  without  loss  of  generality  we  consider  here  equations  of  the  form 

(1.4)  F(x,x)  =  0,  F  :  R"  x  Rn  —  Rn. 

Then,  formally,  and  without  any  of  the  technicalities  involved,  our  approach  is  based  on 
the  following  general  idea:  As  in  [11]  suppose  that  the  zero  set  M  =  F-1(0)  is  a  smooth 
submanifold  of  TRn  =  R"  x  R".  In  practice,  this  is  guaranteed  by  assuming  that  F  is  a 
submersion  on  its  zero  set.  Now  (1.4)  may  be  written  in  the  form 

(1.5)  ( x,x)eM 

which  in  turn  implies  that  any  solution  x  =  x(t )  of  (1.4)  has  to  satisfy  x(t)  €  W  =  ~{M) 
where  z  :  TRn  — >  R"  is  the  canonical  projection  onto  the  first  factor.  If  W  is  a  submanifold 
of  Rn  then  (x(t),x(t))  belongs  to  the  tangent  bundle  TW  of  IV  here  identified  with  a 
submanifold  of  TRn.  In  other  words,  any  solution  x  —  x(t)  of  (1.4)  has  to  satisfy  not 
only  (1.5)  but  also  the  more  restricted  inclusion  (x(t),x(i))  £  Mi  =  TW  fl  M.  The  step 
from  (1.5)  to  this  new  restricted  problem  represents  a  reduction  of  (1.4)  similar  to  that 
involved  in  the  construction  of  the  equation  (1.2)  with  j  =  1.  Hence  it  is  appropriate 
to  call  Mi  the  first  reduction  of  M.  If  M\  and  II'j  =  7r(i\/i)  are  submanifolds  of  TRn 
and  Rn.  respectively,  then  the  reduction  can  be  continued  a  step  further  and  the  same 
argument  yields  (x(f),x(f))  £  M2  =  TTI’i  fl  M\ .  Hence,  under  suitable  conditions,  we 
obtain  a  decreasing  sequence  i\/0  =  M.  M\ ,  -IF  -  •  ■  •  of  manifolds  and  find  that  for  every 
solution  x  =  x(t)  of  (1.4),  (x(f),x(f))  has  to  belong  to  the  ‘core"  <1  Mj  of  M. 

It  is  reasonable  to  expect  the  decreasing  sequence  of  manifolds  M}  to  become  stationary. 
The  first  v  such  that  M u  =  M„+ 1  is  then  called  the  index  of  (1.4).  This  index  definition 
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was  proposed  by  S.  Reich  (see  [12],  [13])  without  further  results  about  the  existence  of  the 
manifolds  or  their  properties.  Clearly  the  practical  value  of  the  approach  depends  largely 
upon  satisfactory  answers  to  the  following  two  questions: 

1)  Are  there  realistic  conditions  for  ensuring  that  Mj+i  and  WJ+\  are  submanifolds  of 
Mj  and  W j  or,  equivalently  of  TRn  and  Rn,  respectively? 

2)  Does  the  reduction  lead  to  a  conclusion  about  the  existence  of  solutions  of  (1.4)? 

There  are  significant  technical  and  conceptual  difficulties  in  providing  positive  answers 
to  both  these  questions.  As  a  typical  example  consider  the  problem  of  deciding  whether 
A/j+i  =  TWj  fl  Mj  is  a  submanifold  which  may  appear  to  be  resolvable  by  a  standard 
transversality  argument.  But  one  soon  finds  that  dimensional  considerations  dictated  by 
the  second  question  rule  out  at  once  transversality  of  Mj  and  TlVj  in  the  natural  ambient 
manifold  TWj_x.  In  fact,  prior  to  anything  else,  it  turns  out  that  the  global  approach  in 
our  expository  sketch  must  be  replaced  by  a  local  one.  This  also  makes  the  ‘subimmersion 
theorem’  available  which  turns  out  to  play  here  a  critical  role. 

After  some  preliminaries  in  Section  2,  a  framework  for  the  local  approach  is  established 
in  Section  3.  Then,  in  Section  4  a  reducibility  concept  is  introduced  and  geometric  condi¬ 
tions  for  reducibility  are  given  which  then  provide  a  positive  answer  to  the  first  question 
(or,  rather,  its  analog  in  the  local  setting).  Next,  in  Section  5,  we  settle  (the  analog  of)  the 
second  question  by  means  of  a  concept  of  complete  reducibility.  The  entire  development 
makes  no  reference  to  DAEs  although  occasionally  we  use  a  simple  DAE  to  motivate  some 
of  the  general  definitions.  However.  Section  6  does  present  the  application  of  the  general 
theory  to  DAEs.  In  particular,  the  connection  to  Reich’s  index  and  an  existence  theory 
for  (1.4)  is  obtained  as  an  immediate  corollary  to  the  result  that,  locally,  the  analog  of  the 
above  indicated  ‘core’  of  M  =  F~1( 0)  is  the  image  of  a  section  of  a  suitable  tangent  bun¬ 
dle,  and  therefore  that,  on  this  core  the  DAE  (1.4)  reduces  locally  to  an  explicit  ODE.  For 
practical  applications  and.  in  particular,  for  numerical  purposes  it  is  desirable  to  express 
the  various  geometric  conditions  in  analytic  terms.  This  is  the  topic  of  Section  7  and  the 
results  given  there  allow,  in  Section  S.  for  a  comparison  between  the  geometric  theory  and 
the  results  of  [11]. 
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Altogether,  it  appears  that  the  new  geometric  treatment  offers  three  important  advan¬ 
tages.  First  of  all,  unlike  in  [11],  constant  rank  conditions  are  required  only  where  they  are 
unquestionably  needed;  that  is,  on  appropriate  submanifolds  and  not  on  open  subsets  of 
the  ambient  space.  For  instance,  it  becomes  legitimate  to  call  (1.4)  a  differential-algebraic 
equation  if  the  constant  rank  condition  for  DpF  is  satisfied  only  at  points  of  M  =  F~1(0). 
This  was  suspected  to  be  true  but  could  not  be  proved  by  the  methods  of  [11].  Next,  as 
discussed  in  Section  8,  a  rather  complicated  condition  in  [11]  involving  second  derivatives 
can  now  be  replaced  by  a  much  simpler  one  involving  only  first  derivatives.  In  fact,  it  was 
not  noticed  in  [11],  and  could  hardly  have  been  expected  there,  that  the  second  derivatives 
play  an  entirely  passive  role.  Finally,  the  geometric  treatment  achieves  its  original  goal  of 
supplying  an  intrinsic  definition  for  the  index  of  DAEs  which  directly  incorporates  a  local 
existence  theory  for  the  equations.  It  appears  to  be  fair  to  say  that  as  long  as  singularities 
are  ruled  out,  the  theory  provides  a  nearly  optimal  answer  to  the  existence  and  uniqueness 
question  for  differential-algebraic  equations. 

2.  Preliminaries. 

Throughout  this  presentation  we  consider  only  finite  dimensional,  separable,  Hausdorff 
manifolds  which,  for  simplicity  of  exposition,  are  assumed  to  be  of  class  C°°.  But  it  should 
be  evident  that  finite  regularity  will  suffice  in  general.  As  usual,  it  is  explicitly  allowed 
that  different  connected  components  of  a  manifold  M  may  have  different  dimensions  and 
we  denote  the  maximal  dimension  of  the  connected  components  of  M  by  dim  M.  When 
all  connected  components  of  M  have  the  same  dimension  we  follow  [4]  and  say  that  M  has 
pure  dimension  dim  M. 

For  any  n-dimensional  manifold  X  we  denote  the  tangent  bundle  by  TX  and  the  canon¬ 
ical  projection  by  -  :  TX  —  X.  Points  of  TX  will  be  written  in  the  form  (x,p)  with  x  6  X 
and  p  €  Tz X.  In  particular,  for  simplicity,  we  always  write  TRn  for  K"  x  Rn.  For  any 
submanifold  Y  of  A'  and  any  point  x  €  Y  the  tangent  space  TXY  is  canonically  identified 
with  a  subspace  of  Tz X  and  hence  the  tangent  bundle  TY  is  identified  with  a  submanifold 
of  TX.  Note  that  this  is  not  a  sub-bundle  since  the  base  manifolds  X  and  Y  of  TX  and 
TY,  respectively,  are  different. 


As  in  f  ile  case  of  manifolds  we  assume  for  simplicity  that  all  mappings  under  consider¬ 
ation  will  be  of  class  C°°  although,  once  again,  this  condition  can  be  reduced  easily.  For 
convenience,  the  notation  /  :  Rk  — ►  Rm  will  be  used  for  any  mapping  /  defined  on  some 
open  subset  U  of  Rk  with  values  in  Rm,  even  if  U  ^  Rk.  This  slight  abuse  of  notation 
should  not  lead  to  any  confusion. 

Recall  that  a  mapping  f  :  X  —*  Y  between  the  manifolds  X  and  Y  is  a  subimmersion 
if  the  rank  of  the  linear  map  Tzf  :  TxX  — ►  Tf(x)Y  is  constant  in  some  open  neighborhood 
of  every  point  x  £  X.  Then  the  rank  r  of  Txf  has  a  constant  value  on  each  connected 
component  E  of  X  called  the  rank  of  /  on  E.  For  simplicity  we  call  /  :  X  — *  Y  a  local 
subimmersion  at  x  £  X  if  there  exists  an  open  neighborhood  U  of  x  in  X  such  that  the 
restriction  of  /  to  the  submanifold  U  of  X  is  a  subimmersion  on  U. 

The  following  subimmersion  theorem  plays  a  key  role  in  several  of  our  arguments;  for  a 
proof  see,  for  instance,  [1],  [4]: 

Theorem  2.1.  (subimmersion  theorem):  Let  X  be  a  connected  manifold  of  dimension 
m  and  suppose  that  the  mapping  f  :  X  — >  Y  from  X  into  another  manifold  Y  is  a 
subimmersion  of  rank  r.  Then ,  for  any  x  £  X  and  xj  =  f(x)  £  Y,  the  following  statements 
hold: 

(i)  The  subset  is  a  closed,  (m  —  r) -dimensional  submanifold  of  X  and  the  tangent 

space  Tx(f~l(xj))  coincides  with  kerTxf. 

(ii)  There  exists  an  open  neighborhood  V  of  x  in  X  such  that  f(V)  is  an  r -dimensional 
submanifold  ofY.  Moreover,  if  N  is  any  r -dimensional  submanifold  of  X  such  that  x  £  N 
and  TXN  Cl  ker  Txf  =  {0}  then  the  restriction  is  a  local  diffeomorphism  of  some 
neighborhood  of  x  in  N  onto  f(V). 

For  the  application  to  DAEs  the  following  characterization  theorem  for  certain  subim¬ 
mersions  will  be  needed: 

Theorem  2.2.  For  the  mapping  G  :  TRm  —*  Rq .  m  <  q  <  2m.  suppose  that  DG(x.p)  has 
full  rank  q  at  a  point  (x0.p0)  £  6' ~ 1  ( 0 )  and  hence  that  for  some  open  neighborhood  U  of 
this  point  in  TRm  the  see  M  =  U  H  G~*(0)  is  a  2m  —  q  dimensional  submanifold  ofTRm. 
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Then  the  restriction  ~|m  :  M  —*  Rm  of  the  canonical  projection  is  a  local  subimmersion 
at  (xq.po)  if  and  only  if  rank  DpG(x,p)  =  p  <  m  is  constant  in  a  neighborhood  of  (x0,po) 
in  M.  In  that  case,  locally  near  (x0,po),  the  rank  of  equals  m  +  p  —  q. 

Proof.  For  ( x,p )  €  M  locally  near  (x0,po)  've  have 

(2.1)  r(l,p)M  =  ker  DG(x,p)  =  {(/i,  k)  €  TRm  :  DxG(x.p)h  +  DpG(x,p)k  =  0}. 

Since  tt  :  TRm  — ►  Rm  is  linear  the  differential  of  7T|a/  at  (x,p)  is  simply  the  restriction  of 
~  to  T(ZtP)iV/;  that  is,  the  mapping 

(2.2)  (h,k)  €  T(IiP)M  h  €  Rm. 

Because  dim  T(IjP)ilf  =  dim  ker  DG(x,p)  =  2m  —  q  is  constant  in  a  neighborhood  of 
(x0,p0),  the  mappings  (2.2)  will  have  constant  rank  exactly  if  their  null-spaces  are  of 
constant  dimension  locally  near  (x0,po)-  From  (2.1)  we  see  that  (0,  k)  €  T(liP)M  if  and 
only  if  DpG(x,p)k  =  0;  that  is,  exactly  if  k  €  ker  DpG(x,p),  and  ker  DpG(x,p)  has 
constant  dimension  for  (x,p)  €  M  near  (x0,p0)  if  and  only  if  DpG(x,p)  has  constant  rank 
on  .1/  locally  near  (xo,po).  Moreover,  if  this  rank  is  equal  to  p  then  ker  DpG(x,p)  has 
dimension  m  —  p  and  hence  the  mapping  (2.2),  and  equivalently  7Tu/,  has  locally  near 
(x0,po)  the  rank  2m  —  q~(m  —  p)  =  m  +  p  —  q.  □ 

As  noted  already,  the  development  in  the  next  four  sections  will  be  independent  of 
DAEs.  But  the  following  simple  DAE  shall  occasionally  be  used  to  motivate  some  of  the 
concepts: 


(2.3) 


F(x.i) = 


!  Xi  —  cos  X?  \ 

Xi  —  X3 

V  x2  —  1  / 


=  0.  F  :  TR3 


Obviously,  DF(x.p)  has  rank  3  everywhere  and  hence  M  =  F  1  (0)  is  a  3-dimensional 
submanifold  of  TR3.  If  x  =  i(t)  is  any  C1  -solution  of  (2.3)  then  by  differentiating  the 


algebraic  equation  x^t)  —  cosx2(t)  =  0  and  using  the  differential  equations  we  see  that 
this  solution  must  also  satisfy  the  equation  X3(t)  +  sin  12(f)  =  0.  In  other  words,  any  such 
solution  must  be  contained  in  the  set 

(2.4)  F  =  3*'(0)CR3,  g(x)  =  (xi  -  cosx2,  x3  +  sinx2)r,  x  G  R3, 

which,  because  of  rank  Dg(x)  =  2,  x  G  R3,  is  a  one-dimensional  submanifold  of  R3.  Thus, 
if  x  =  x(f)  is  a  solution  of  (2.3),  then  (x(f),x(f))  G  TY  flF-1(0)  and  it  is  obvious  that  the 
converse  is  true. 

3.  7r-submanifolds. 

For  the  DAE  (1.4)  suppose,  as  in  the  Introduction,  that  M  =  F~l{ 0)  is  an  n-dimensional 
submanifold  of  TR"  and  hence  that  (1.4)  may  be  written  in  the  form  (1.5);  that  is, 

(3.1)  (x,x)gA/. 

In  fact,  a  central  feature  of  our  approach  is  the  use  of  (3.1)  and  the  study  of  its  geo¬ 
metric  implications.  The  reduction  process  sketched  in  the  Introduction  replaces  (3.1)  by 
a  sequence  of  problems  of  the  same  form  but  with  different  submanifolds  M.  Thus,  for 
the  development  of  this  process  we  need  to  characterize  the  submanifolds  M  that  will  be 
allowed  to  arise  in  (3.1). 

Clearly,  for  a  given  problem  (3.1)  with  an  arbitrarily  given  submanifold  M  of  XRn  there 
may  be  no  solution  at  all.  For  instance,  in  the  case  n  =  2  this  is  certainly  true  for  the 
one-dimensional  submanifold  M  =  {(x,p)  G  TR2;  x2  =  xj,  pi  =  1,  p2  =  xj}.  Moreover, 
when  M  has  dimension  larger  than  n  then  there  are  usually  many  solutions  through  a 
given  point. 

In  connection  with  (3.1)  our  guiding  situation  will  be  the  choice  M  =  <p(F)  where 
<p  :  Y  — >  TY  is  a  section  of  the  tangent  bundle  TY  of  some  connected  submanifold  Y  of 
R".  Then  the  resulting  problem  (3.1)  is  locally  equivalent  to  an  explicit  ODE  and  hence 
is  locally  solvable.  This  occurred  for  the  example  (2.3)  where  on  the  tangent  bundle  of  the 


s 


submanifold  Y  of  R3  given  by  (2.4)  the  canonical  projection  ~  :  TY  D  F-1(0)  — *  Y  has  the 
global  inverse 

<p  :Y  —*  TY,  <p(x)  =  (x,p),  p  =  (x3,  l,-cosx2)r, 

whence  -  in  this  case  globally  -  M  =  TY D  F-1(0)  coincides  with  y?(F)  and  the  DAE  (2.3) 
is  equivalent  to  the  ODE  x  =  (X3, 1,  —  cosx2)r  with  initiaJ.  data  on  Y. 

This  guiding  case  suggests  that,  in  general,  we  should  require  that  M  is  embedded  in 
the  tangent  bundle  TY  of  some  submanifold  Y  of  Rn  of  the  same  dimension  as  M.  Of 
course,  as  noted  before,  this  dimensional  restriction  will  have  to  be  formulated  locally. 

As  indicated  in  the  Introduction,  the  class  of  equations  of  the  general  form  (1.4)  also 
includes  the  singular  differential  equations  for  which  the  existence  theory  is  substantially 
different  from  that  of  ODEs  and  DAEs.  This  has  to  be  reflected  in  the  allowable  choice  of 
the  manifolds  in  the  problem  (3.1).  In  the  existence  theory  of  [11]  these  singular  equations 
where  essentially  excluded  by  the  assumption  that  rank  DpF(x,p)  =  p  <  n  in  some  open 
set  in  the  domain  of  F.  This  condition  certainly  holds  for  our  example  (2.3).  As  men¬ 
tioned  before  it  will  suffice  to  introduce  such  a  constant  rank  condition  for  DPF  only  on 
neighborhoods  of  points  of  M.  As  Theorem  2.2  shows  this  is  essentially  equivalent  with 
the  assumption  that  locally  the  restriction  of  the  canonical  projection  tt  to  the  manifold  is 
a  subimmersion. 

In  line  with  these  introductory  remarks  we  now  introduce  the  following  class  of  sub¬ 
manifolds  M  that  shall  be  allowed  in  problems  of  the  form  (3.1): 

Definition  3.1.  Let  X  be  an  n-dimensional  manifold.  .4  submanifold  M  of  TX  is  a 
n -submanifold  (ofTX)  if  for  each  connected  component  E  of  M  the  following  conditions 
hold: 

(i)  For  any  (x.p)  €  E  there  exists  an  open  neighborhood  U  in  E  of  (x.p)  and  a  submanifold 
Y  of  X  such  that  dim  I"  =  dimE  and  U  C  TY. 

(ii)  The  mapping  tT|_  :  E  — *•  A*  is  a  subimmersion  in  some  neighborhood  of  any  point 
(x.p)  €  E.- 

2Since  E  is  connected  this  is  equivalent  to  assuming  that  is  a  subimmersion  on  E. 


9 


If  Y  is  a  connected  submanifold  of  A'  and  p  :  Y’  —*  TY  a  section  of  TY  then  M  = 
r(Y)  turns  out  to  be  a  "-submanifold  of  TX.  If  A’  has  pure  dimension  n,  every  nure  n- 
dimensional  submanifold  of  TX  satisfies  the  condition  (i)  of  the  definition  with  Y  =  X  but 
not  necessarily  the  condition  (ii).  However,  in  our  simple  example  (2.3)  the  3-dimensional 
submanifold  M  =  ,F-1(0)  of  R3  is  a  --submanifold  of  TR3.  In  fact,  we  have  everywhere 
rank  DpF(x,p)  =  2  which  by  Theorem  2.2  implies  that  7t| m  is  a  subimmersion  on  all  of  M. 
This  example  shows  that  there  exist  --submanifolds  that  are  not  the  image  of  a  section. 

One  of  our  principal  aims  will  be  to  prove  that  under  certain  assumptions  a  general  tr- 
submanifold  contains  another  ‘maximal’  x-submanifold  which,  at  least  locally,  is  the  image 
of  a  section  <p  :  Y'  — *  TY  for  some  submanifold  Y  of  AT.  This  maximal  7r-submanifold  will 
be  obtained  by  the  recursive  reduction  procedure  to  be  described  below.  In  preparation 
we  introduce  some  needed  terminology  derived  from  Definition  3.1. 

Let  E  be  any  connected  component  of  the  7r-submanifold  M  of  TX  and  (x,p)  €  E  any 
given  point.  Then,  by  Definition  3.1  (ii),  the  rank  of  t:|e  is  constant  on  some  neighborhood 
of  (x,p)  and  therefore  on  all  of  E  due  to  the  connectedness  of  E.  Moreover,  by  the 
subimmersion  theorem  (Theorem  2.1)  there  exists  an  open  neighborhood  V  of  (x,p)  in 
E  (and  therefore  also  in  M )  such  that  IV'  =  -(F)  is  a  submanifold  of  Ar  of  dimension  equal 
to  the  rank  r  of  -|^  on  E.  Obviously,  the  rank  r  cannot  exceed  the  dimension  of  E. 

Definition  3.2.  Let  E  be  any  connected  component  of  the  n -submanifold  M  ofTX.  For 
any  (x,p)  €  E  the  rank  of  it |,  at  (x,p)  is  the  order  of  that  point  in  M  and  is  denoted 
by  ord]\f(x,p).  The  order  of  all  points  of  E  is  the  same  and  we  write  ord^/H.  IfV  is  an 
open  neighborhood  in  E  of  (x,p)  €  E  such  that  IF  =  7r(F)  is  a  submanifold  of  X  with 
dim  IF  =  dim  E  then  IF  is  called  a  local  projection  of  E  (or  M)  at  (x,p). 

The  observations  preceeding  the  definition  show  that  local  projections  exist  at  each 
point  of  M .  The  following  remark  provides  a  useful  technical  tool  for  later  use: 

Remark  3.1.  Let  E  be  a  connected  component  of  a  7r-submanifold  M  of  TX  and  at  a 
point  (x0,p o)  €  E  choose  -  in  accordance  with  Definition  3.1  (i)  -  an  open  neighborhood 
U  in  E  and  a  submanifold  Y  of  A'  such  that  dirnl  =  dimx.  and  U  C  TY.  Then  TY  is  a 
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submanifold  of  TX  and  therefore  U  is  a  submanifold  of  TY.  Now  Definition  3.1  (ii)  requires 
that  tt|2  :  E  — +  -Y,  and  therefore  also  7?\u  :  U  —*  X.  is  a  subimmersion  in  some  neighborhood 
of  (xo,po).  This  holds  if  and  only  if  :  U  — ►  Y  is  a  subimmersion  in  a  neighborhood 
of  (x0iPo)-  In  other  words,  it  does  not  matter  whether  U  is  viewed  as  a  submanifold  of 
TX  or  of  TY  since  for  (x,p)  €  U  near  (x0,po)  we  have  T(IiP)( xrjt/)  =  {T(c>p)it)\T(i  y  and 
T{X'P)U  is  contained  in  both  T(z<p)(TX)  and  T(Z<p)(TY).  This  argument  aiso  proves  that 
the  order  of  (xo,po)  equals  the  rank  of  it\y  :  U  —*  Y. 

The  next  theorem  provides  a  rather  simple  condition  for  a  7r-submanifold  to  be  -  locally 
-  the  image  of  a  section. 

Theorem  3.1.  Let  M  be  a  it- submanifold  such  that 
(3.2)  dim  E  =  ord.\/E, 

for  each  connected  component  E  of  M .  Then  for  any  (x0,po)  €  M  there  exists  a  local 
projection  IV  =  it(V)  of  M  at  (xo,p0)  and  a  section  <p  :  W  —*  TW  such  that  V  =  ^{W). 

Proof.  Let  E  be  a  connected  component  of  M  and  (xo,po)  €  E  a  given  point.  By  Remark 
3.1  and  the  hypothesis  (3.2)  it  follows  that  -\u  :  U  — ►  Y  has  full  rank  m  =  dim  E  at  (io»Po) 
and  hence  that  it\y  is  a  diffeomorphism  of  some  open  neighborhood  V  C  U  of  (xo,po)  onto 
the  open  subset  W  =  7r(V)  of  Y .  Let  :  IF  — ►  V  be  the  inverse  diffeomorphism.  then  it 
follows  that  (x,p)  €  V  if  and  only  if  x  =  ~(x,p)  €  W  and  (x,p)  =  <p(x ). 

Now  note  that  V  is  a  submanifold  of  TW.  Indeed,  W  is  open  in  Y  whence  TW  = 
( "i v ) —  1  ( W)  =  it~1(W)nTY.  Moreover  V  C  TY  and  V  C  tt“1(T-V')  =  7r-1(jr(F))  together 
imply  that  V  C  TW.  Hence,  since  both  V  and  TW  are  submanifolds  of  TY,  we  see  that 
V  is  a  submanifold  of  TW.  Thus,  instead  of  viewing  as  a  mapping  with  values  in  V'.  we 
may  consider  it  to  be  a  mapping  with  values  in  TW.  Finally,  is  a  section  of  TW  since 
"  o  ip(x)  =  x  holds  for  all  x  £  IF  by  definition  of  y.  □ 

Clearly,  the  assumption  that  M  is  a  “-submanifold  is  crucial  to  the  proof.  Naturally,  for 
a  general  7r-submanifold  M  the  condition  (3.2)  may  not  hold  for  all  connected  components 
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n.  Our  reduction  process  will  ensure  that  under  appropriate  assumptions  this  condition 
will  hold  for  some  submanifold  of  M. 

4.  Reducible  7r-submanifolds. 

As  before  we  formulate  the  implicit  DAE  (1.4)  as  a  problem  of  the  form  (3.1)  where 
now  M  is  a  given  7r-submanifold  of  TX.  Let  x  =  x(t)  be  a  local  solution  of  (3.1)  through  a 
point  (x(0),p(0))  =  (xo,p0)  €  M.  Because  M  is  a  ^"-submanifold  there  is  a  neighborhood 
V  of  (x0,po)  *n  M  such  that  W  =  7r(V)  is  a  local  projection  at  that  point.  Hence,  for  all 
sufficiently  small  t  we  have  (x(t),  x(t))  €  V  which  implies  that  x(t)  €  W  and,  since  W  is  a 
submanifold  of  X ,  that  (x(t),x(t))  €  TW.  In  particular,  we  must  have  (x0,po)  €  TW  f)M 
which,  evidently,  represents  a  necessary  condition  for  the  existence  of  a  local  solution 
through  that  point.  This  situation  is  reflected  in  the  following  terminology: 

Definition  4.1.  For  the  manifold  X  let  M  be  any  tt -submanifold  ofTX.  A  point  ( x,p )  € 
M  is  a  point  of  reducibility  of  M  if  for  some  local  projection  W  =  tt(V)  of  M  at  (x,p)  we 
have  p  €  TXW  and  hence  (x,p)  6  TW  fl  M.  The  subset  of  all  points  of  reducibility  of  M 
is  the  reduction  of  M  and  is  denoted  by  M' . 

Obviously,  if  the  condition  p  €  T(x<p)W  holds  for  some  local  projection  W  of  M  at  (x,p) 
then  it  has  do  so  for  all  others  as  well.  Hence  the  concept  of  a  point  of  reducibility  is 
independent  of  the  particular  choice  of  the  local  projection  W  =  7r(V)  of  M  at  (x,p). 

In  the  following  we  shall  be  concerned  only  with  the  reduction  M'  of  M  and  the  structure 
of  M  away  from  M'  will  be  irrelevant.  In  particular,  the  hypothesis  that  the  rank  of  7T|m 
is  locally  constant  will  never  be  called  upon  at  any  point  of  M\M'.  Nevertheless,  local 
constancy  of  the  rank  of  ~|u  near  every  point  of  M  is  a  requirement  that  cannot  be 
weakened,  for  it  is  needed  in  the  first  place  to  check  whether  any  given  point  of  M  is  or  is 
not  a  point  of  M'l 

As  an  example  of  a  reduction  consider  the  3-dimensional  7r-submanifold  M  =  F-1(0) 
of  TR3  for  the  simple  DAE  (2.3).  Obviously,  its  projection 


W  =  ~(M)  =  {x  €  R3  :  xi  —  cosx2  =  0} 


is  a  2-dimensional  submanifold  of  R3  with  tangent  bundle 


TW  =  {(x, p)  €  TR3  :  x i  —  COSX2  =  0,  pi  +p2sinxo  =  0}. 

Thus  the  global  projection  of  M  is  here  a  manifold  and  the  reduction  A/'  of  M  coincides 
with  TW  fl  M  and  is  globally  characterized  by  the  system  of  equations 

x\  -  cos x2  =  0,  pi  +  p2  sin x2  =  0,  pi  =  x3,  p2  =  1. 

By  substituting  the  third  and  fourth  equation  into  the  second  equation  it  follows  that 
M'  =  Fj-1(0)  where 


iu  :  TR3 -^R4,  Fi(x,p) 


/  X]  —  cosx2  \ 
X3  +  sinx2 
Pi  -  x3 
\  P2-I  / 


Obviously  we  have  rank  DFi(x,p)  =  4  and  rank  DpFi(x,p )  =  2  everywhere.  Thus  M'  is 
a  2-dimensional  submanifold  of  TR3  and,  since  Definition  3.1  (i)  holds  with  Y  =  W  and 
Theorem  2.2  applies,  we  see  that  M'  is  again  a  "-submanifold. 

In  general,  the  situation  is  not  so  simple  and  some  additional  conditions  are  needed  to 
ensure  that  the  reduction  is  again  a  "-submanifold.  We  begin  with  a  basic  topological 
property  of  il/'. 

Theorem  4.1.  The  reduction  M'  of  a  tt -submanifold  M  is  a  closed  subset  of  M. 

Proof.  We  prove  that  M\M'  is  open  in  M  which  is  obvious  if  M'  —  M.  Let  (x0,po)  € 
M\M'  and  E  the  connected  component  of  M  containing  that  point.  By  assumption  there 
exists  an  open  neighborhood  V  of  (x0,po)  in  E  such  that  W  =  "(F)  is  a  local  projection 
and  hence  W  is  a  submanifold  of  A’  of  dimension  p  =  ord jv/E.  Upon  shrinking,  if  needed, 
V  and  hence  also  W  we  may  assume  that  there  is  a  chart  [fl,  ip\  of -V  with  domain  fl  D  W 
f or  which  v(fl)  and  t/»( TV)  are  open  subsets  of  Rn  and  Rp,  respectively.  Hence,  by  means  of 
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the  corresponding  bundle  chart,  TO.  can  be  viewed  as  Q  x  Rn  and  TW  as  W  x  Rp.  Under 
this  transformation  V  becomes  a  submanifold  of  Q  x  Rn  which  is  projected  onto  XV  and 
the  condition  p0  £  TXW  corresponding  to  (x0,po)  ^  A/'  is  now  p0  ^  Rp.  As  a  result  we 
have  p  ^  Rp  and  hence  (x,p)  £  TW  for  all  (x,p)  €  V  sufficiently  close  to  (x0,po)-  This  is 
equivalent  with  (x,p)  ^  M'  and  the  result  follows.  □ 

The  following  result  motivates  a  condition  under  which  the  reduction  M'  of  a  ~- 
submanifold  does  retain  a  differentiable  structure: 

Proposition  4.1.  Let  M  be  a  ir -submanifold  and  (x,p)  a  point  in  the  reduction  M'  of 
M.  Then  for  any  local  projection  IV  of  M  at  (x,p)  we  have 

(4.1)  dim[T(XiP)TTU  n  >  ord\i{x,p)- 


Proof.  Let  E  be  the  connected  component  of  M  containing  a  given  point  (xo,Po)  €  M' 
and  V  an  open  neighborhood  of  the  point  in  E  such  that  W  =  k(V)  is  a  local  projection 
of  E  at  that  point.  By  Definition  3.1  (i)  there  exists  a  neighborhood  U  of  (x0,Po)  in  E 
and  a  submanifold  Y  of  X  with  dim  Y  =  dim  E  such  that  U  C  TY.  Without  loss  of 
generality,  we  may  assume  that  V  C  U  whence  TV'  C  7r(t/)  C  Y .  Since  XV  and  Y  axe  both 
submanifolds  of  X  we  see  that  XV  is  a  submanifold  of  Y  and  hence  that  Tz  XV  is  a  subspace 
of  TZY  for  any  x  €  IV.  This  shows  that  TW  is  a  subset  of  the  bundle 

(4.2)  E  =  ^LJ^[{x}  x  TZY\. 

In  fact.  E  is  just  the  pull-back  bundle  of  TY  under  the  canonical  embedding  W  C  Y . 
Moreover  E  is  a  submanifold  of  TX.  Indeed,  let  {y,q)  €  E;  that  is,  y  €  W  and  q  €  TyY . 
Since  W  and  Y  are  submanifolds  of  A'  it  follows  that,  locally  near  y,  we  have  XV  =  p-I(0) 
and  Y  =  /i-1(0)  for  some  submersions  g  :  X  —*  Rn~p  and  h  :  X  — *  Rn-m  where  p  =  ord\/E 
and  m  =  dim  y  =  dim  r.  Using  a  chart  (Q.  'P]  of  X  near  y  and  the  corresponding  bundle 
chart  [Tfi,  T'P]  of  TA'  near  (y,q)  we  may  assume  A'  =  Rn.  Thus  in  some  neighborhood  of 
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(y,y),  E  coincides  with  the  zero  set  of  the  mapping 


(x,p)  €  TRn  ~  (g(x),Dh(x)p)e 

It  is  easily  checked  that  this  mapping  is  a  submersion  at  (y,g)  whence  E  is  a  {p  +  m)- 
dimensional  submanifold  of  TRn.  Hence,  TW  is  not  only  a  subset  but  a  submanifold  of 
E.  Moreover,  from  V  C  U  C  TY  and  (x,p)  e  V  it  follows  that  x  €  W  and  p  e  TZY .  In 
other  words,  we  have  V  C  E  and  thus  V  is  also  a  submanifold  of  E.  Therefore,  (4.1)  is  a 
direct  consequence  of  dimTW  =  2 p,  dimK  =  dimE,  and  dim  E  =  p  +  dimE.  □ 

Since  generically  two  subspaces  of  dimensions  2 p  and  m,  respectively,  of  a  (p  +  Tri¬ 
dimensional  space  intersect  along  a  p-dimensional  subspace,  Proposition  4.1  indicates  that 
equality  in  (4.1)  should  be  expected  in  most  cases.  This  partly  justifies  the  following 
concept: 

Definition  4.2.  Let  X  be  a  given  manifold.  A  z -submanifold  M  of  TX  is  reducible  if 
for  every  point  (x,p)  of  the  reduction  M'  of  M  there  exists  a  local  projection  W  of  M  at 
(x,p)  such  that 

(4.3)  &™TUv)  =  ordw(x,P),  T[I  V)  =  T(Z<P)TIV  n  Tix,p)M 
and  if  on  some  neighborhood  U  of  (x,p)  in  TW  fl  M 

(4.4)  rank  *\T'  q)  =  constant ,  V(y,<j)  €  U. 

Obviously  (4.3)  is  independent  of  the  local  projection  W  and  this  is  essentially  also  true 
of  (4.4)  if  we  confine  attention  to  sufficiently  small  neighborhoods  of  the  point. 

The  term  reducible  in  the  above  definition  is  justified  by  the  following  result: 

Theorem  4.2.  Let  M  be  a  reducible  z -submanifold.  Then,  the  reduction  M'  of  M  is 
either  empty  or  a  closed  submanifold  of  M  and  a  z-submanifold  ofTX.  Moreover  for  any 


15 


(x,p)  €  M'  the  dimension  of  the  connected  component  of  M'  containing  (x,p)  equals  the 
order  of  (x,p)  in  M  and  for  every  local  projection  \V  =  ~{V)  of  M  at  (x,p)  we  have 

(4.5)  T(XiP)Mf  =  T(z>p)(TW  DV)  =  T(x<p)TW  D  T(XiP)M. 

Proof.  The  closedness  of  M'  in  M  was  shown  in  Theorem  4.1.  If  W  =  -(F)  is  a  local 
projection  of  M  at  the  given  point  (x,p)  €  M'  then  the  condition  (4.3)  expresses  the  fact 
that  the  submanifolds  TW  and  V  of  the  bundle  £  in  (4.2)  intersect  transversally  at  (x,p) 
since  T(XtP)M  —  T(XjP)V.  This  implies  that  TW  D  V  is  a  p- dimensional  submanifold  of 
TW  and  M  and  also  that  the  second  equality  in  (4.5)  holds.  On  the  other  hand,  since 
W  =  ir(F)  is  a  local  projection  of  M  not  only  at  (x,p)  but  also  at  any  other  point  of  V 
and  since  V  can  be  chosen  so  small  that  the  rank  of  Tt\v  is  constant  on  all  of  F,  we  see  that 
M'  and  TW  fl  F  must  coincide  .  This  implies  that  M'  is  a  submanifold  of  M  and  that  the 
first  equality  in  (4.5)  holds.  Moreover,  the  dimension  of  the  connected  component  of  M' 
containing  (x,p)  has  to  equal  dim  T(XtP)M'  which,  by  (4.3)  and  (4.5),  is  p  =  ordA/(x,p). 

Since  M'  and  TW DV  coincide  there  exists  an  open  connected  neighborhood  U'  of  (x,p) 
in  M'  contained  in  TW.  Hence  we  have  U'  C  H'  where  E'  is  the  connected  component  of 
M'  containing  (x,p).  Now  for  Y'  =  W  it  follows  from  the  relation  dimE'  —  p  —  dim  Y' 
that  the  pair  ( U',Y ')  satisfies  the  conditions  required  of  the  pair  (U,Y)  in  Definition  3.1 
(i).  Finally,  because  of  (4.5),  the  relation  (4.4)  implies  that  7T|  has  constant  rank  near 
(x,p)  and  hence  that  M'  is  a  --submanifold.  □ 

From  Theorem  4.2  and  its  proof,  condition  (4.3)  in  Definition  4.2  represents  a  transver- 
sality  condition  for  the  intersection  TW  fl  M  in  an  appropriate  local  setting  and  the  condi¬ 
tion  (4.4)  ensures  that  -|  \/<  has  locally  constant  rank.  As  observed  earlier,  transversality 
is  not  true,  in  general,  in  the  ‘natural’  ambient  manifold  TX. 

With  this  we  can  reformulate  Theorem  3.1  as  follows: 

Theorem  4.3.  Let  M  be  a  reducible  it -submanifold  of  TX  for  which  the  reduction  M'  is 
identical  with  M.  Then  for  every  point  (x,  p)  €  M  there  exists  a  local  projection  W  =  x(V) 
of  M  at  this  point  and  a  section  <p  :  W  — »  TW  such  that  V  =  <p(W). 
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Proof.  The  connected  component  E  of  Af  containing  (x.  p)  is  also  the  connected  component 
of  M'  containing  ( x.p )  whence  by  Theorem  4.2  the  dimension  of  E  equals  the  order  of  ( x.p) 
in  M  and  hence 

dim  E  =  ord  m(xiP)  =  ord,\/E. 

Thus  the  dimension  of  every  connected  component  E  of  M  is  equal  to  the  order  of  E  in  M 
and  the  result  follows  from  Theorem  3.1.  □ 

We  end  this  section  with  a  result  characterizing  reducible  7r-submanifolds  that  coincide 
with  their  reduction. 

Proposition  4.2.  Let  M  be  a  submanifold  ofTX  and  suppose  that  for  every  (x,p)  6  M 
there  exists  an  open  neighborhood  U  in  the  connected  component  E  of  M  containing  (x,p) 
and  a  submanifold  Y  of  X  with  dimTF  =  dimE  such  that  U  C  TY .  Then,  M  is  a 
reducible  ir -submanifold  with  AT  =  Af  if  and  only  if  T(IiP)7r  is  a  linear  isomorphism  from 
T(XiP)M  to  TXY  irrespective  of  the  choice  of  (x,p)  6  Af. 

Proof.  The  necessity  is  obvious  and  we  prove  only  the  sufficiency.  If  the  isomorphism 
condition  holds  then  tt|3  :  E  — ►  Y  has  maximum  rank  at  ( x,p )  and  hence  is  a  local 
diffeomorphism  on  some  neighborhood  of  that  point.  In  particular,  tt\s  has  maximum 
rank  at  all  points  of  Af  near  (x,p)  so  that  condition  (ii)  of  Definition  3.1  holds.  Moreover, 
a  local  projection  W  =  ~(V)  of  Af  at  (x,p)  is  an  open  neighborhood  of  x  in  Y  whence 
TW  =  7r-1(iy)  certainly  contains  (x,p).  In  other  words,  every  point  of  M  is  a  point  of 
reducibility  of  Af;  that  is,  Af  =  Af.  The  conditions  (4.3)  and  (4.4)  require  here  that 
dim  T(rip)iV/  =  dim  I'  and  that  the  rank  of  tt|x(I(  is  constant  for  all  (y,q)  in  some 
neighborhood  of  (x.p).  But  these  relations  have  already  been  proved  above  and  hence  the 
result  follows.  □ 


5.  Completely  reducible  7r-submanifolds. 

As  before,  let  X  be  an  n-dimensional  manifold  and  Af  a  reducible  7r-submanifold  of 
TX.  Then  by  Theorem  4.2  the  reduction  At'  of  Af  is  either  empty  or  a  7r-submamfold  of 
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TX.  Of  course,  if  M  ^  0  we  do  not  know  whether  M1  is  again  reducible,  but  if  that  is  the 
case,  then  the  reduction  M2  =  M[  of  M\  =  M'  is  again  either  empty  or  a  7r-submanifold 
of  TX.  This  suggests  the  following  recursive  definition: 

Definition  5.1.  The  reducible  tt -submanifold  M  =  Mq  ofTX  is  completely  reducible  if 
for  every  index  j  >  0  such  that  Mj  ^  0  the  reduction  Mj+i  of  Mj  is  either  empty  or  a 
reducible  tc -submanifold.  For  Mj  =  0  we  set  MJ+i  =  0.  Then  the  sequence  {M:}}>0  is 
well-defined  and  called  the  reduction  chain  of  M . 

Let  M  be  a  completely  reducible  7r-submanifold  of  TX  with  reduction  chain  {Mj}j> o- 
If  Mjfc+i  =  Mk  for  some  index  k  >  0  then  it  is  obvious  that  Mj  =  A/*  for  j  >  k; 
that  is,  the  reduction  chain  becomes  stationary.  Since  reduction  never  increases  the  local 
dimension  near  any  point  ( x,p )  it  is  intuitively  evident  that  every  reduction  chain  should 
become  stationary.  The  following  result  proves  this  fact  and  introduces  rigorously  the 
index  concept  given  by  S.  Reich  in  [12],  [13]. 

Theorem  5.1.  Let  M  be  a  completely  reducible  ■x -submanifold  ofTX  with  dim  M  =  m 
and  reduction  chain  {Mj}j>0.  For  any  non-empty,  connected  component  Em+i  of  Mm+1 
define  Ej,  j  =  m,  m  —  1, . . .  ,0,  recursively  as  the  connected  component  of  Mj  containing 
Ej+i .  Then  there  exists  a  smallest  integer  v,  0  <  v  <  dim  Ho  <  m,  the  index  of  Em+1  in 
M,  such  that 

(5.1)  E'  =  Zj,  j  =  v,v  +  1,...  ,m  +  1. 

In  particular,  the  reduction  chain  of  M  always  satisfies 

(5.2)  Mm+2  —  Mm+i. 


Proof.  For  Mm+ 1  =  0  the  first  part  of  the  theorem  is  vacuous  and  (5.2)  is  obvious.  Suppose 
therefore  that  there  exists  a  non-empty  connected  component  Em+j  of  Mm+1.  Then  the 
connected  components  E;  €  Mj  constructed  in  the  theorem  satisfy  0  ^  Em+i  C  — j+i  C  E) 
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for  j  =  0 . m.  Evidently,  the  reduction  E'  of  E_,  equals  z.':  =  Mj+i  fl  Ej  and,  since  EJ+i 

is  a  connected  component  of  MJ+1,  it  is  also  a  connected  component  of  E':  =  A/J+1  fi  E;. 
Thus  we  have 

(5.3)  E;+1  C  E'  =  Mj+i  nE;  CE;,  j  =  0, . . .  , m. 

and,  because  connected  components  of  closed  sets  are  closed,  it  follows  by  Theorem  4.2 
that  Ej  is  closed  in  M  for  all  j.  In  particular,  E;+J  is  closed  in  E;.  Now  E;  ^  0  ensures 
that  fij  =  dim  E7  is  defined  for  j  =  0, . . .  ,  m  4-  1  and  (5.3)  implies  that 

(5.4)  0  <  pm+1  <  nm  <  ...  <  no  <  m. 

The  last  inequality  follows  from  the  fact  that  Eo  is  a  connected  component  of  M  and  hence 
has  dimension  at  most  m.  Hence,  by  necessity,  two  of  the  integers  in  (5.4)  must  be  equal 
and  there  exists  a  smallest  index  v  among  0  <  v  <  m  for  which  /i„+i  =  /z„.  This  implies 
that  E„+i  is  an  open  submanifold  of  E„  and,  since  it  is  also  a  nonempty  and  closed  subset 
of  E„  and  E„  is  connected,  that  E„  =  E„+1 .  But  then  (5.3)  shows  that  E„+i  =  E'„  =  E„; 
that  is,  reduction  does  not  affect  and,  of  course,  neither  will  repeated  reduction.  In 
other  words,  we  must  have  E;  =Ejforj  =i/,i/  +  1,...  ,  m  + 1  which  is  (5.1).  Forj  =m  +  l 
it  follows  that  for  any  nonempty  connected  component  Em+i  of  Mm+i  the  reduction  E'm+i 
is  equal  to  Em+i  which  proves  (5.2).  □ 

Definition  5.2.  For  the  completely  reducible  ~ -submanifold  M  with  reduction  chain 
{Mj)j> o  the  core  C(M)  of  M  is  the  intersection  H  M} . 

Hence,  for  any  completely  reducible  --submanifold  M  of  TX  Theorem  5.1  implies  that 
C(M )  =  Mm+ 1  with  m  =  dim M  and  that  C(M)  is  either  empty  or  a  (completely) 
reducible  --submanifold  with  C{M)'  =  C(M).  Thus  Theorems  4.2  and  4.3  at  once  provide 
the  following  result: 

Theorem  5.2.  Let  M  be  a  completely  reducible  - -submanifold.  Then ,  the  core  C(M ) 
of  M  is  either  empty  or  a  closed  submanifold  of  M  and.  moreover,  C(M)  is  a  reducible 
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tt -submanifold  equal  to  its  reduction.  In  particular,  for  every  (x.p)  €  C{M)  there  exists 
a  local  projection  IV  =  "(V)  of  C{M)  at  ( x.p )  and  a  section  :  IV  — ►  TW  such  that 
V  =  <p(IV). 

Theorem  5.1  defines  the  index  of  any  nonempty  connected  component  E  of  C(M)  and 
shows  that  this  index  is  at  most  equal  to  the  dimension  dimEo  of  the  connected  component 
of  M  containing  E.  Hence,  for  any  point  {x.p)  €  C{M)  the  index  may  be  defined  as  the 
index  of  the  connected  component  of  C(M)  containing  ( x.p ).  Then,  again,  the  index  of 
{x.p)  never  exceeds  the  dimension  of  the  connected  component  of  M  containing  the  point. 

At  a  first  sight,  Theorem  5.1  may  appear  to  imply  that  the  relation  (5.2)  could  be 
improved  to  Mm+ 1  =  Mm .  This  is  not  the  case,  for  it  may  happen  that  the  reduction  E'm 
of  some  connected  component  Em  of  Mm  is  empty.  All  we  can  say  is  that  when  E'm  ^  0 
then  E'm  =  Em  which  is  weaker. 

More  generally,  note  that  any  open  subset  of  a  --submanifold  M  obviously  is  a  7r- 
submanifold  with  the  same  dimension  as  M.This  fact  was  used  implicitly  in  the  proof  of 
Theorem  5.1.  Thus,  if  a  point  {x.p)  €  Af  has  an  open  neighborhood  U  that  is  a  completely 
reducible  7r-submanifold  and  for  which  {x.p)  belongs  to  the  core  C{U)  oiU .  then  we  may 
define  a  local  index  of  {x.p)  as  the  index  of  {x.p)  in  C{U).  It  is  easily  checked  that 
shrinking  U  has  no  effect  on  this  definition  and  that  when  {x.p)  has  a  local  index  and  M 
is  completely  reducible,  then  (x,p)  €  C{M)  and  the  index  and  local  index  of  the  point 
coincide.  Thus,  the  concept  of  a  local  index  is  useful  only  when  not  all  of  M  is  completely 
reducible.  It  is  equally  straightforward  to  verify  that  if  each  point  of  M  is  contained  in  a 
completely  reducible  neighborhood,  then  M  is  completely  reducible. 

6.  Application  to  Differential- Algebraic  Equations. 

As  an  application  of  our  theory  we  consider  now  an  implicit  differential  equation 

(6.1)  F(x{t),  x{t))  =  0, 

where  x  again  stands  for  dxjdt.  Here  F  :  Rn  x  Rn  — ♦  Rn  is  assumed  to  be  a  C°°-mapping 
and  we  recall  our  agreement  that  F  may  only  be  defined  on  an  open  subset  of  Rn  x  Rn. 
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A  C1 -solution  of  (G.l)  is  a  Cl  function 


(6-2)  x  :  J  — +  Rn,  J  c  R  open  interval, 

such  that  (6.1)  holds  for  all  <  €  J. 

As  indicated  at  the  end  of  the  Introduction  the  geometrical  theory  allows  us  to  weaken 
the  definition  of  an  implicit  differential-algebraic  equation: 

Definition  6.1.  The  implicit  differential  equation  (6.1)  is  a  nonsingular  differential  - 
algebraic  equation  (DAE)  if  M  =  F-1(0)  is  a  completely  reducible  n -submanifold  of 
Rn  x  Rn  =  TRn. 

Then  we  obtain  the  following  basic  existence  and  uniqueness  result: 

Theorem  6.1.  Let  (6.1)  be  a  nonsingular  DAE  and  denote  the  core  of  M  =  F-1(0)  by 

C(M). 

(i)  Any  C1 -solution  (6.2)  of  (6.1)  satisfies  (x(t),  x(t))  e  C(M )  for  all  t  £  J. 

(ii)  Conversely,  for  any  (x0,p0)  €  C(M )  there  exists  a  local  projection  W  ~  p(V)  ofC(M) 
(^o,po)  and  a  section  <p  :  W  — ♦  TW  such  that  for  any  C1  function  (6.2)  we  have 

(6-3)  (x(t),x(t))  e  V  (x(t)  €  Wand  (x(t),x(t))  =  <p(x(t))}. 

Hence,  locally  near  any  ( x0,p0 )  €  C(M),  (6.1)  is  equivalent  to  an  explicit  ODE  on  IV. 
In  particular,  for  any  t0  €  R  there  exists  an  open  interval  J  containing  t0  and  a  unique 
C1  -solution  (6.2)  of  (6.1)  such  that  {x(t0),x(t0))  =  (x0,p0)-  Moreover,  x  is  of  class  C°°. 

Proof.  For  the  proof  of  (i)  let  x  be  any  C!-solution  (6.2)  of  (6.1)  and  for  any  fixed  t0  €  J  set 
(x(t0),i(to))  =  (xo,Po)-  Then  there  exists  a  local  projection  IV  =  7r(V)  of  M  at  (x0,p0) 
and  obviously,  for  all  t  G  J  near  t0  we  have  x(<)  €  W  and  therefore  (x(t),x(t))  €  TW; 
that  is.  (x(t),x(t))  t  TW  D  M.  But  TW  D  M  coincides  with  the  reduction  Mi  of  M  near 
(*o,Po)  so  that  (x(t),  x(t))  €  M \  for  t  near  t0  and  hence  for  all  t  €  J  since  t0  was  arbitrary. 
Let  Mj,  j  =  0, 1, . . .  denote  the  reduction  chain  of  M  =  M0.  By  proceeding  inductively, 


21 


we  then  find  that  (x(t),x(t))  E  ilij  for  j  =  0,1,...  and  all  t  €  J.  Thus,  in  particular, 
( x(t),x(t ))  €  C(M)  since,  by  definition  C(M)  =  H  Mj.  This  proves  (i). 

For  the  proof  of  the  first  part  of  (ii),  recall  that  by  Theorem  5.2  there  exists  a  local 
projection  W  =  n(V)  of  C(M)  at  (x0,po)  and  a  section  ip  :  W  — ►  TIV  such  that 

(x,p)  €  V  <=*►  {x  E  IV  and  (x,p)  =  <p(x)}. 

This  proves  (6.3).  Moreover,  since  <p  is  a  section  of  TIV  there  exists  an  open  interval  J  and  a 
unique  C1  function  x  :  J  — ►  IV,  of  class  C°°,  for  which  x(t0)  =  Xo  and  (x(t),x(t))  =  <p(x(f)) 
for  all  t  E  J.  Here  x(t)  denotes  Ttx  ■  1,  but,  because  W  is  a  submanifold  of  R",  this  is 
just  the  usual  derivative  ( dx/dt)(t ).  Hence  we  have  (x(t),x(t))  E  V  C  C(M)  C  M,  so  that 
F(x(t),x(t))  =  0  for  all  t  E  J ■  In  other  words,  x  is  a  C^-solution  of  (6.1)  of  class  C°°.  The 
uniqueness  follows  from  part  (i)  and  (6.3),  and  the  uniqueness  of  the  constructed  solution 
x  as  an  integral  curve  of  ip  since,  necessarily,  (6.3)  must  hold  for  t  in  some  open  subinterval 
of  J  around  t0-  D 

By  using  arguments  closely  related  to  those  in  the  proof  of  Theorem  3.1  of  [11],  we  can 
formulate  also  global  results  for  the  solutions  of  (6.1): 

Theorem  6.2.  Let  (6.1 )  be  a  nonsingular  DAE.  Then  any  C1  -solution3 (6.2)  of  (6.1)  can 
be  extended  to  a  C1 -solution  of  (6.1)  on  an  interval  (a,  b)  that  is  maximal  under  set 
inclusion.  Moreover  at  the  endpoints  this  solution  has  the  properties: 

(i)  Ifb  <  oo  (ora  >  —oo)  and  x(t)  is  bounded  fort  E  (a,  b)  nearb  (ora),  then  lim  x(<)  =  Xo 
(or  lim  x(t)  =  x0)  exists. 

t— a+ 

(ii)  Ifb  <  oo  (or  a  >  —oo)  and  lim  x(f)  =  x0  (or  lim  x(t)  =  xo)  exists,  then  lim  |x(f)|  = 

f— » i>-  t— >a+  t— 6_ 

oo  (or  lim  |x(t)|  =  oo). 

*— a  + 

As  a  result,  if  b  <  oo  (or  a  >  —  oo),  then  x(t)  is  unbounded  for  t  E  (a,  b )  near  b  (or  a). 

Since  the  proof  follows  very  closely  that  of  Theorem  3.1  in  [11]  we  give  here  only  a  brief 
sketch.  By  Theorem  6.1  (ii)  any  two  solutions  of  (6.1)  defined  in  open  intervals  J\  and 

3Note  that  by  Theorem  6.1  any  Cl-solution  (6.2)  of  (6.1)  is  necessarily  of  class  C°°. 


J 2 ,  respectively,  with  (xi(f0),  i‘i  (<o ))  =  (xi(fo ),  ^2(^0 ))  for  some  t0  G  J\  D  J2  must  coincide 
in  J 1  H  J2 .  Hence  they  continue  each  other  in  J\  U  J2  as  a  solution  of  (6.1).  In  turn, 
this  argument  yields,  for  instance  by  Zorn's  lemma,  the  existence  of  a  maximal  interval  of 
definition  (a,b)  for  any  C^-solution  x  of  (6.1). 

Now  part  (i)  of  the  statement  follows  readily  with  the  help  of  the  integral  mean  value 
theorem  x(t)  —  x(s)  =  x(r)dr.  For  part  (ii)  we  argue  by  contradiction.  Suppose 
that  b  <  00  and  that  there  is  a  sequence  tk  G  (a,&),  lim  tk  =  b  such  that  i(tjt)  is 

k— 00 

bounded.  By  extracting  a  subsequence  we  find  a  p0  G  Rn  such  that  lim  i(tk)  =  P 0 

k— 00 

whence  lim  (x(tk),x(tk))  =  (xo,po)-  Note  that  (x0,po)  €  C(M)  since,  by  Theorem  5.2, 

k—00 

C(M)  is  closed  in  M  =  F-1( 0)  and  M  is  closed  in  Rn.  Moreover,  the  same  theorem 
ensures  that  C{M)  coincides  with  the  graph  of  a  section  :  W  — ►  TW  in  a  neighborhood 
of  (x0,po)  where  W  =  z(V)  is  a  local  projection  of  C(A1)  at  (x0,po)-  Thus  it  follows  that 
for  p  €  Rn,  sufficiently  close  to  po,  we  have 


(6.4) 


(x0,p)  €  C(M )  =>p-p0. 


Consider  now  any  other  sequence  s*  €  (a,  6)  such  that  lim  s*  =  b.  By  extracting  a 

it— 00 

subsequence  we  may  assume  either  that  lim  |x(sjt)|  =  00  or  that  lim  x(sjt)  =  po  €  R". 

k—00  k—00 

In  the  first  case,  as  well  as  in  the  second  case  for  po  7^  Po,  the  same  arguments  as  in  the 
proof  of  Theorem  3.1  of  [11]  can  be  used,  together  with  the  closedness  of  C(M )  to  show 
that  for  every  sufficiently  small  6  >  0  there  exists  a  p$  6  Rn  such  that  (x0,p«)  G  C{M) 
and  |po  ~  Ps\  —  <5.  But  this  contradicts  (6.4)  and  thus  we  necessarily  have  lim  x(t)  =  po- 

Altogether  we  find  that  lim  (x(t),i(t))  =  (x0,Po)-  Moreover,  using  once  again  the  fact 

t— 6_ 

that  C(M)  is  the  graph  of  the  section  p  :  W  — ►  TW  near  (xo,po)>  we  see  that  (x(f),x(t)) 
belongs  to  V  for  all  t  G  (a,  b)  sufficiently  close  to  b.  Hence  x  coincides  with  the  unique 
solution  x  of  the  initial  value  problem 


(x(t).i(t))  =  <p(x(#)),  (x(b),x(b))  =  (x0 , po  )• 


But  then  x  extends  x  beyond  b  as  a  solution  of  (6.1),  in  contradiction  with  the  maximality 
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of  (a,  b).  The  case  when  t  — ♦  a+  in  (ii)  of  the  theorem  can  be  handled  in  the  exact  same 
way.  □ 

Let  x  denote  a  C^-solution  (6.2)  of  a  nonsingular  DAE  (6.1).  By  Theorem  6.1  (i)  we 
then  have  (x(t),  x(t))  £  C(M)  for  all  t  £  J  where  M  =  F-1(0)  and  all  points  (x(f),  x(t)) 
for  t  £  J  must  belong  to  the  same  connected  component  E  of  C(M).  Theorem  5.1  defines 
the  index  of  any  such  component  E  and  it  is  natural  to  call  it  the  index  of  the  solution 
x.  Thus,  for  the  DAE  (6.1)  the  index  of  a  solution  x  is  simply  the  number  of  successive 
reductions  of  M  =  F~l( 0)  needed  to  obtain  x  (locally)  as  a  solution  of  an  explicit  ODE. 
Note  that  this  index  is  of  a  solution  is  not  a  local  concept  at  a  point  of  x  but,  in  fact,  is 
independent  of  the  point  on  the  trajectory. 

Nonautonomous  problems  of  the  form  F(t,x(t),x(t))  =  0  can  easily  be  included  in  this 
theory.  In  fact,  after  using  again  the  trick  of  adding  the  scalar  equation  t  =  1,  we  may 
apply  the  results  to  the  autonomous  equation  F(X(r),X(r))  =  0  with 

T  :  Rn+1  x  Rn+1  — ►  Rn+1,  F(x,p)  —  (s  —  l,F(£,x,p)),  x  =  (f.x),  p  =  (s,p). 

We  conclude  this  section  with  the  simple  example  of  a  planar  pendulum.  Of  course, 
our  interest  does  not  center  on  the  particular  application  but,  instead,  on  illuminating  the 
details  of  the  reduction  process.  This  example  was  chosen  for  its  simple  form  which  permits 
an  explicit  manipulation  of  all  equations  arising  in  the  reduction  process  and  allows  for  a 
direct  identification  of  the  entire  chain.  Moreover,  the  reduction  chain  turns  out  to  end  a 
forteriori  with  a  system  equivalent  to  the  clas;::al  ‘pendulum  equation’  which  therefore  is 
recovered  here  without  any  guess  of  the  proper  variables. 

Suppose  a  mass  m  is  attached  at  the  end  of  a  rigid  massless  wire  of  length  £  hanging 
from  the  origin. If  A  is  the  tension  of  the  wire  and  g  the  gravity  constant  then  the  problem 
may  be  modelled  by  the  second  order  DAE 


which  with  x3  =  £1,  x4  =  xo,  and  £5  =  A/m  transforms  into  the  form  (6.1)  where 

o  0  /?2 

xr  +  x;  -  t 

/ 

Xi  -  x3 

(6.6)  F  :  TR5  — +  R5,  F(x,x)=  £2  —  £4 

X3  X  j  X5 

V .  . 

£4  +  X2X5  +  g 

Clearly  rank  DF(x,p)  =  5  and  rank  DpF(x,p)  =  4  on  M  —  F-1( 0)  and  hence  M  is  a 
pure  5-dimensional  submanifold  and,  by  Theorem  2.2,  also  a  7r-submanifold  of  TR5. 

The  projection  of  M  is  W  =  ir(M)  =  {x  €  R5  :  xx  -f  x|  —  =  0}  and  hence  is  a 

4-dimensional  submanifold  of  R5  with  tangent  bundle 

TW  =  {(x,p)  €  TR5  :  x\  +  Xj  —  £2  =  0,  xipi  +  x2p2  =  0). 


Since  the  globed  projection  of  M  is  a  manifold  all  local  arguments  are  here  of  a  globed 
nature  and  the  reduction  M\  of  M  coincides  with  TW  fl  M  and  is  characterized  by  the 
equations  x\  +  x\  —  t?  =  0,  Xipi  +  X2P2  =  0  coupled  with  the  system 

Pi  -  x3  =  0, 

P2  —  x4  —  0, 

(6.7) 

p3  -f-  £i£s  —  6, 

Pi  +  £2£5  +<7  =  0. 


With  the  first  and  second  equations  of  (6.7),  xipi  +  X2P2  —  0  becomes  xix3  +  £214  =  0 
and  hence  we  find  that  Mx  =  Fx  *(0)  where 
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Here  DF\(x,p)  has  full  rank  G  at  every  point  of  F^fO)  and  hence  M\  is  a  pure  4- 
dimensional  submanifold  of  TR5.  Moreover,  Definition  3.1  (i)  holds  with  Y  =  W  and 
by  Theorem  2.2  it  follows  that  Mx  is  a  rr-submanifold  of  TR5.  The  projection  of  Mi  is  the 
pure  3-dimensional  submanifold 


Wi  =  7 r(Mi)  =  {x  €  R5  :  x\  +  x\  -  i2  =  Q,  xjxj  +  x2x4  =  0} 


of  R5  which  has  the  tangent  bundle 


TWi  =  {(x,p)  €  TR5  :  x\  +  x\  -  i2  =  0,  xjX3  +  x2x4  =  0, 

xipi  4-  X2P2  =  0,  x3pi  +  x4p2  +  xip3  +  x2p4  =  0}. 

Hence  we  see  that  (x,p)  £  Mo  =  TW\  D  M\  exactly  if  (x,p)  satisfies  the  system  of 
differential  equations  (6.7)  now  coupled  with  the  equations 


(6.8) 


X1  +  x2  ~  £2  =  Of 
xix3  +  x2x4  =  0, 

X1P1  +  X2P2  =  0, 

X3P1  +  x4p2  +  xip3  +  x2p4  =  0, 


and  again  M2  turns  out  to  be  the  reduction  of  Mi 


By  using  (6./)  we  can  express  pi5P2>P3,P4  in  all  four  equations  (6.8)  in  terms  of  x.  Then 
the  third  equation  is  the  same  as  the  second  one  and  can  be  dropped.  Thus  we  see  that 
M2  =  F2-1(0)  with 


/ 


F2(x,p)  = 


x?  +  x\  -  e 

X1X3  +  x2x4 

<7x2  -  X5  -  x\  +  rx5 

Pi  -  X3 

P2  -  x4 

P  3  +  Xjx5 

P4  +  X2X5  +  <7 


\ 


,  F2  :  TR5  -»  R7, 


) 


2G 


and  a  trivial  verification  shows  that  DFvix.p)  has  full  rank  7  on  F2-1(0)  while  £>pF2(x,p) 
has  constant  rank  4  everywhere.  Therefore,  in  the  same  manner  as  before  we  conclude 
that  M2  is  a  pure  3-dimensional  7r-submanifold  of  TR5. 

A  further  step  uses  the  projection  W2  =  x(M2)  of  M2;  that  is. 


IV2  =  {x  €  R3  :  x\  +  x\  —  £2  =  0,  xix3  +  12x4  =  0,  gx 2  —  xj  —  xl  +  £2x3  =  0), 


which  is  a  pure  2-dimensional  submanifold  of  R5  with  the  tangent  bundle 

TW2  ={(x,p)  €  TR5  :  xj  -(-  x\  -  £2  =  0,  xjx3  +  x2x4  =  0, 
gx  2  -  xj  -  xj+  fx5  =  0,  xipx  +  x2p2  =  0, 

X3P1  +  X4P2  +  X1P3  +  X2P4  =  0,  gh2  -  2 x3p3  -  2x4p4  +  £2p5  =  0}. 


Hence  it  follows  that  (x,p)  €  M3  =  TW3  =  TW2  fl  M2  if  and  only  if  (x,p)  satisfies  the 
system  (6.7)  coupled  with  the  equations 


+  x\  -  £2  =  0, 


(6.9) 


I  X9X4  —  0, 

gx2  -  xj  -  X4  +  £2x5  =  0, 

x\ Pi  +  X2P2  —  0, 


X3P1  +  X4P2  +  X\p3  +  X2p4  =  0, 
gp2  -  2x3p3  -  2x4p4  +  £2p5  =  0, 


and  that  M3  is  the  reduction  of  M3- 

Once  again,  we  use  (6.7)  to  express  Pi,P2,P3,P4  in  all  equations  (6.9)  in  terms  of  x. 
Then  the  fourth  equation  is  the  same  as  the  second  one  and  can  be  eliminated.  Moreover, 
with  the  help  of  the  first  equation  the  fifth  equation  turns  out  to  be  identical  with  the 
third  one  and  can  also  be  dropped.  Finally,  by  means  of  the  second  equation  the  sixth 
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/ 


(6.10) 


F3(x,p)  - 


\ 


equation  reduces  to  3<7X4  +  £2ps  =  0.  Hence,  altogether,  it  follows  that  M3  =  F3-1(0)  with 

2,2  t>2 

X1  +  X2  —  t 
XiX3  +  X0X4 

gx2  -  x\  -  x\  +  £2x5 
Pi  -  x3 


P  2  ~  x 4 
P  3  +  xjx5 

Pa  +  x2x$  +  g 

\  3  g 

X  P5  +  fx4 


F3  :  TR 


R 


/ 


The  derivative  DF3(x,p)  has  rank  S  on  M3  —  F-1(0)  and  DpF3(x,p )  has  rank  5  every¬ 
where.  Thus  by  the  same  reasoning  as  before  M3  turns  out  to  be  a  pure  2-dimensional 
7r-submanifold  of  TR5. 


Finally  we  note  that  the  projection  of  M3  is 


W3  =  7 r(M3)  =  {x  €  R5  :  xj  +  x\  -  £2  =  0, 

x3x3  +  x2X4  =  0,^X2  -  x3  -  x2  +  £2x5  =  0}. 

and  hence  that  W3  =  \V2.  As  a  result  we  have 


m3  n  tiv3  =  m3  n  tiv2  =  (m2  n  riy2)  n  tw2  =  m2  n  tw2  =  m3; 

that  is,  A/3  equals  its  reduction  M4  and,  from  this  point  on,  the  process  becomes  stationary. 
Altogether  we  see  here  that  C(M)  =  M3. 

It  follows  that  C(M)  =  M3  and  that  each  point  of  C(M)  has  index  3.  By  Theorem  6.1, 
the  DAE  reduces  locally  to  a  first  order  ODE  on  ~(C(M))  =  tt(M3)  =  iy3  =  W2.  In  this 
simple  case,  this  ODE  can  be  found  explicitly.  In  fact,  for  instance,  in  the  neighborhood 
of  the  equilibrium  position  X!  =  0.  x2  =  -£,  we  can  solve  F3(x,p)  =  0  by  expressing 
p  1,...  ,p5  and  i2,x4,X5  in  terms  of  xi  and  x3.  This  shows  that  (x,p)  €  C(M)  =  M3  if 
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and  only  if 


x2  =  -(£2 -x2)1'2, 

(6.11) 

x4  =  x3x3(£2  -  x2)~1/2 

*s  =  (g/t2)(£2  ~  x\)112  +  x\(£2  -  x2)~\ 

and 

Pi 

=  x3 , 

P2 

=  XiX3(^2  -  Xj)“1/2, 

(6.12) 

Pz 

=  -(g/£2)xi(£2  -  x2)1/2  -  xlX\(£2  -  x?)_1, 

P4 

=  -(g/t2){£2  -  x\)  -  x\{£2  -  x2)-1/2, 

Ps 

=  -(3g/£2)xiX3(£2  -  x2)_1/2, 

Now  note  that  for  x  €  W2  the  right-hand  side  of  (6.12)  automatically  belongs  to  TXW2. 
Hence  by  replacing  p  by  x  we  obtain  explicitly  the  following  ODE  on  W2: 

x\  =  x3, 

x2  =  xiz3(^2  -  x\)~'!\ 

(6.13)  x3  =  -{g/£2)xi{£2  -  x2)1/2  -  xnl(£2  -  x2)"1, 

x4  =  -(g/£2)(£2-x2)-x2(£2-x2)^2, 

X5  =  ~(3g/£2)xix3(£2  -  x2)-1/2. 

Interestingly,  (6.13)  reduces  to  a  system  consisting  only  of  the  first  and  third  equations. 
In  fact,  for  given  initial  data  x°  and  x°,  x3  and  x3  are  uniquely  determined  by  these  two 
equations  and  then  x2-,x^,x3  are  explicitly  derived  from  (6.11).  In  turn,  this  2x2  system 
is  equivalent  to  the  single  scalar  second  order  equation 

xi  =  ~(g/£2)xi(£2  -  x\Y12  -  xxx\{£2  -  x?)-1, 

which,  with  x\  =  £sin0,  becomes 

0  =  ~(g/£)smO; 

that  is,  the  classical  ‘pendulum  equation’. 
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7.  Analytic  Characterizations. 

For  the  pendulum  example  all  considerations  were  of  a  global  nature  and  all  equations 
characterizing  the  manifolds  of  the  reduction  chain  could  be  manipulated  explicitly.  But 
this  is  hardly  a  typical  situation.  For  most  practical  applications  it  will  be  important  to 
obtain  analytic  criteria  corresponding  to  the  various  geometric  conditions  contained  in  the 
Definitions  3.1,  4.1,  and  4.2. 

The  first  condition  of  Definition  3.1  requires  that  locally  near  the  point  (x0,po)  under 
consideration  the  manifold  M  is  embedded  in  TY  where  Y  is  a  submanifold  of  X  with  the 
same  dimension  m  as  M.  Clearly,  this  condition  can  hardly  be  translated  into  analytic 
terms  and  must  be  assumed.  This  is  not  a  major  inconvenience  because  the  reduction 
procedure  of  Sections  4  and  5  automatically  preserves  this  property  and  provides  explicit 
information  about  the  choice  of  Y .  Moreover,  as  we  saw  in  the  previous  section,  at  the 
beginning  of  the  process  Y  is  typically  given  by  Y  =  X  =  Rn. 

All  the  geometric  conditions  are  local  in  nature  at  a  point  (x0,po)  €  Y  and  are  unaltered 
by  natural  tangent  bundle  isomorphisms;  that  is,  those  occuring  as  tangent  maps  of  local 
diffeomorphisms  defined  in  the  neighborhood  of  x0  in  X.  Hence,  there  will  be  no  loss  of 
generality  in  assuming  that  the  problem  has  been  represented  in  the  following  form: 

Local  Assumption.  The  space  Y  is  given  byY  =  Rm  (or  some  open  subset  of  Rm/)  and 
there  exists  an  open  neighborhood  U  in  TY  of  (xo,po)  £  TY  such  that  M  =  U  D  G-1(0) 
where  G  :  TY  — »  Y  is  a  (smooth)  submersion  on  U;  that  is,  DG  has  full  rank  m  on  U. 

This  assumption  simply  means  that,  locally  at  the  given  point  (x0,po),  we  have  replaced 
the  manifold  M  by  its  connected  component  (of  dimension  m)  containing  the  point  and 
that  we  used  a  natural  tangent  bundle  chart  for  TY  at  (xo,po)  (in  the  terminology  of  [1]). 
Note  that  with  Y  =  Rm  we  have  TY  =  TRm  =  Rm  x  Rm  and  that  the  canonical  projection 
7 r  :  TY  — ♦  Y  is  the  projection  onto  the  first  factor. 

From  Theorem  2.2  and  Remark  3.1  we  obtain  immediately  the  following  result: 

Proposition  7.1.  Under  the  Local  Assumption,  condition  (ii)  of  Definition  3.1  holds  if 
and  only  if  DpG(x,p )  has  constant  rank  p  <  m  for  all  (x,p)  €  M  in  some  neighborhood 
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of  (x0,po)and  in  that  case  we  have  ord.\f(xo,po)  =  P- 

Before  turning  to  a  characterization  of  reducibility  in  Definition  4.2,  we  need  analytic 
representations  of  local  projections  in  the  setting  of  our  Local  Assumption.  For  this  suppose 
that  M  is  a  7r-submanifold  and  hence,  in  accordance  with  Proposition  7.1,  that  DpG(x,p) 
has  constant  rank  p  locally  near  (x0,po)- 

By  Definition  3.2  a  local  projection  W  of  M  near  (x0,po)  is  the  projection  ir{V)  of  a 
sufficiently  small  open  neighborhood  V  of  (xo,Po)  in  A/  such  that  W  is  a  submanifold  of 
Y  =  Rm  of  the  same  dimension  p  as  the  order  of  (x0,Po)  in  M.  Thus,  locally  near  x0, 
the  manifold  W  can  be  represented  in  the  form  M  =  p-1(0)  where  g  :  Rm  —*  Rm  p  is 
a  submersion  on  some  neighborhood  of  xo;  that  is  Dg  has  full  rank  m  —  p.  The  tangent 
bundle  TW  C  TRm  is  then  characterized  by 

(7.1)  TW  =  {(x,u)  €  TRm  :  g(x)  =  0,  Dg{x)u  =  0}. 

Hence  any  point  (x,p)  €  TW  D  V  satisfies  g(x)  —  0,  Dg(x)p  =  0,  and  G(x,p)  =  0.  But 
the  latter  equation  states  that  (x,p)  €  V  and  hence  implies  that  x  €  W  which  is  g{x)  —  0. 
Therefore  we  have 

(7.2)  TW  DV=  {(x,p)  €  TRm  :  Dg{x)p  =  0,  G(x,p)  =  0}. 


Now  the  question  arises  how  g  can  be  obtained  from  G.  Clearly  dimkerDpC7(xo,po)  — 
m  —  p  and  hence  there  exists  a  linear  map 

(7.3)  .4  €  £(Rm,Rm-p),  ker.4nkerDpG(x0,po)  =  {0} 
with  full  rank  m  —  p.  Using  .4  we  construct  the  mapping 

(7.4)  G  :  TRm  Rm  x  Rm-P,  G(x.p)  =  (G(x,p),  A(p  -  p0)). 


for  which  evidently  G'(xo,po)  =  0  and 


( i  .5)  DG(  xo ,  Po  )  —  An  — 


DzG{xq,pq)  DpG(x0<Po, 


€  £(rRm.Rm  x 


m  mm  lCtrn~P^ 


31 


has  full  rank  2m  —  p.  Hence,  G-1(0)  (restricted  to  some  neighborhood  of  (xo,po))  is  a 
p-dimensional  submanifold  N  of  M  which  at  that  point  has  the  tangent  space  T(Xo<po)N  = 
ker  A0.  As  in  the  proof  of  Theorem  2.2  note  that  the  differential  of  -j  m  at  (x0,po)  is  simply 
the  restriction  of  tv  to  T(Xopo)M.  Hence  we  have 


T(*o,po)n  n  ker  r|T(,0i,0)Af  =  ker  Ao  n  [{0}  x  ker  DpG(x0,p0)]. 


In  other  words,  any  element  ( h,k )  of  the  space  on  the  left  has  to  satisfy  the  relations 
Ao (h,k)  =  0,  h  =  0,  and  DpG(xo,p0)k  =  0  which  by  (7.3)  imply  that  k  =  0.  Thus  by 
the  second  part  of  Theorem  2.1  it  follows  that  tt|  jv  is  a  local  diffeomorphism  of  some  open 
neighborhood  of  (z0,po)  in  M  to  W. 

For  the  construction  of  a  local  coordinate  system  on  N  let  Qo  G  £(Rm)  be  a  projection 
onto  some  complement  Z0  of  the  range  of  DpG(x0,p0)  so  that  dim  Z0  =  m  —  p  and 

(7.6)  Z'  =  kevQQDzG{iQypQ). 

has  at  least  dimension  p.  From  Q0DG(x0,p0)  =  QoDxG(x0,po)  it  follows  that  ( h,k )  G 
kerA0  implies  h  €  Z' .  Conversely,  for  h  G  Z'  we  have  DzG(x0,p0)h  G  rge  DpG(x0,Po) 
so  that  DG(iQ,po)(h,  k)  =  0  for  some  k  G  and,  since  A  is  an  isomorphism  of 
ker DpG(xo,po)  onto  Rm~*>  there  exists  a  k'  G  ker DpG(xo,po)  for  which  Ak'  =  —  Ak. 
But  then  we  have  Ao (h,k  +  k')  =  0  which  shows  that  the  projection  tt  maps  kerAo  onto 
Z1  and  hence  that  dim  Z'  <  p.  It  follows  that  dim  Z'  =  p. 

For  any  complement  Z"  of  Z'  in  Rm  we  now  have  dim  Z”  =  m  —  p  and 

(7.7)  QoDzG(xo,po)\z"  G  Isom(Z",Zo). 

In  accordsmce  with  the  decomposition  Rm  =  Z'  ©  Z"  we  shall  henceforth  write  x  =  s'  +  z" 
for  any  x  G  Rm  and,  in  particular,  set  x0  =  Cq  +  -o'. 

It  is  easily  checked  that  A0  is  an  isomorphism  of  Z"  x  Rm  to  Rm  x  Rm-'\  Hence  the 
implicit  function  theorem  applies  to  the  equation  G(z'  +  z",p)  =  0,  and  we  obtain  for  N 
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the  local  coordinate  representation 


(7.8)  (*,p)  €  N  z"  =  *(*'),  p  =  iP(z'), 

where  <p  and  ip  are  smooth  functions  in  some  neighborhood  of  Zq  such  that  Zq  =  <p(r'), 
po  =  ip(z'0).  Evidently 

(7.9)  g  :  Rm  =  Z‘  ©  Z"  -+  Z"  ~  Rm_p,  ff(x)  =  z"  -  <p(z'). 

satisfies  Dz»g(x 0)  =  /.?»  and  therefore  is  a  submersion  at  xo-  From  (7.8),  and  after 
shrinking  W  if  necessary,  it  follows  that 


x  €  W  z"  =  <p(z') 


which  implies  that  W  =  g  J(0). 

For  later  purposes,  we  require  the  derivatives  Dg{x0)  and  D2g(x0)(p0,  ■)  when  (x0,po)  € 
TW  n  M.  Because  of  (7.9)  it  suffices  to  calculate  D<p(z'0)  and  D2(p(z'0)  which  can  be  done 
by  implicit  differentiation  of  G(z'  +  <p(z'),  ip(z'))  =  0.  By  a  straightforward  calculation 
using  (7.6)  we  obtain 

(7.10)  Dip(z'0)  =  0,  Dtp(z'0)  =  0, 
and  therefore 

(7.11)  Dg(x0)h  =  h",  h  =  ti  +  h"  €  Z'  BZ"  =  Rm. 

From  this,  in  turn,  an  equally  simple  calculation  provides 

D2g(io)(h.k)  =  {QqDzG{xq,p0)\z"\  1  QoD‘G(xo,po)(h' .  k'), 

h  =  h'  +  h",  k  =  k'  +  k"  eZ'®  Z"  =  Rm. 
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Now  for  ( xo,po )  €  TWOM  it  follows  from  (7.1)  that  Dg(xo)po  =  0  which  by  (7.11)  implies 
that  p0  =  p o  and  thus  with  k  =  p0  above 


(7.12) 


D2g(x0)(p0,h)  =  [QoDIG(xo,Po)\z"}~lQoD2xG(x0,po)(p0,h'), 

h  =  h'  +  h"  eZ'B  Z"  =  Rm. 


With  these  results  we  obtain  now  the  following  characterization  of  the  conditions  in 
Definition  4.2. 


Proposition  7.2.  Suppose  that  at  (xo,po)  €  T\V  fi  M  with  W  =  ir{V),  the  Local  As¬ 
sumption  holds.  For  any  (x,p)  in  V  let  Q(x,p)  be  the  projection  of  Rm  onto  an  arbitrary 
complement  Z(IjP)  of  the  range  of  DpG(x,p)  and  write  Zq  =  Z(x opo)  and  Q0  —  Q{xo,po). 
Then  the  conditions  (4.3)  and  (4.4)  of  Definition  4.2  hold  if  and  only  if  the  mapping 


(7.13) 


/  QoDlG(x0,Po){po,-) 

\  DxG(x0,po) 


QoDxG(x0) 

DpG(x0,po) 


6  £(TRm,  Z0  x  Rm) 


has  full  rank  2m  —  p  and,  alter  shrinking  V  if  necessary,  for  all  (x,p)  €  TW  fl  M  the 
relation 


(7.14) 


dim  (kerQ(x,p)DxG(x,p)  D  kerDpG(x,p)]  = 

dim  [ker(5o£>iG(x0,po)  H  kerZ?pG(x0,po)j, 


holds.* 


Proof.  Since  g  is  a  submersion  at  x0  it  follows  that  the  mapping 

(x,u)  €  TRm  ~  (g{x),Dg(x)u)  €  Rm”'  x  Rm~p 

is  a  submersion  at  (xo,  it)  irrespective  of  u.  Hence,  because  of  (7.1)  and  (xo,po)  €  TW,  we 
find  that 

T(ro,Po)TW  =  Uh *  *)  €  ^Rm  :  Dg(x0)h  =  0.  D2g(x0)(Po , h)  +  Dg(x0)k  =  0}, 

4Obviously  (7.14)  is  independent  of  the  specific  cho:ce  of  the  projections  Q(x,p). 
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and  therefore  (h,k)  €  T(l0iPo)TI7  D  T(z0yPo)M  exactly  if 


Dg(x0)h  =  0, 

D2g(x0)(po,h)  +  Dg{x0)k  =  0, 
DxG(x0,P0)h  +  DpG(x0,p0)k  =  0. 


By  (7.11)  the  first  relation  states  that  h  e  Z'  =  kerQ0DxG(x0,po)  which  is  also  a  conse¬ 
quence  of  the  third  relation.  In  other  words,  the  first  relation  is  redundant  and  the  above 
set  of  conditions  reduces  to  (h,k)  €  ker-L(x0,po)  where 


(7.15) 


/D2g(x)(p,-)  Dg(x)  \ 
\  DxG{x,p )  DpG(x,p)J 


Thus  the  condition  (4.3)  is  equivalent  with  dimker£(xo,po)  =  p ■  From  (7.11)  and 

(7.12)  together  with  (7.7)  we  find  that  (h,k)  e  kerL(x0,po)  exactly  if 

Q0D2zG(x0,po)(po,  ti)  +  QoDzG(xo,po)k"  =  0, 

DxG(x0,p0)h  +  DpG(x0,p0)k  =  0. 

By  multiplying  the  second  equation  with  Q0  we  see  that  Q0DxG(x0,Po)h  =  0  and  hence 
that  h  =  h'  €  Z’.  Thus,  in  the  first  equation  h'  may  be  replaced  by  h.  Since  k"  may, 
of  course,  be  replaced  by  k  it  follows  that  ker  Z,(x0,po)  is  the  null-space  of  the  mapping 

(7.13)  and  that  dimker  L(xo,po)  =  P  if  and  only  if  the  rank  of  (7.13)  (and  also  of  £(xo,po)) 
equals  2m  —  p  and  hence  is  full. 

Suppose  now  that  the  condition  (4.3)  holds  and  therefore,  as  we  just  saw,  that  £(x,p) 
has  rank  2 m  —  p  for  all  (x,p)  €  TKm  near  (xo,po)-  In  order  to  show  that  (7.14)  is  equivalent 
to  (4.4),  note  that  L(x,p )  is  the  derivative  of  the  mapping  (x,p)  >-v  (Dg(x)p,  G(x,p))  whose 
local  zero  set  near  (x0,p0)  is  precisely  TWC\V.  This  means  that,  locally  near  (x0,Po),  TWO 
V  is  a  p-dimensional  submanifold  of  TRm  whose  tangent  space  at  (x,p)  is  kerT(x.p).  The 
condition  (4.4)  now  states  that  the  rank  of  the  linear  mapping  p)(rivnv)  is  constant 
for  all  (x,p)  €  TW  D  V  near  (x0,po)-  Equivalently,  this  means  that  the  mapping 


(7.16) 


(h,  k)  €  ker  L(x.p)  •— ►  h 
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has  constant  rank  for  (x.p)  £  TW  fl  V'  near  (xo,po)-  Since  keri(x.p)  has  dimension  p  it 
follows  that  (4.4)  holds  exactly  if  the  dimension  of  the  null-space  of  (7.16)  is  constant  for 
all  (x,p)  €  TW  fl  V  near  (x0,po)-  This  null-space  consists  of  those  pairs  (0,&)  €  XRm  for 
which 

Dg(x)k  =  0,  DpG(x,p)k  =  0. 

But  Dg(x)k  —  0  means  that  k  £  TXW  =  ir(T(X}P)M).  Then  there  exists  some  £  £  Rm 
such  that  DxG(x,p)k  +  DpG(x,p)£  =  0  and  by  multiplying  with  Q(x,p)  we  see  that 
Q(x,p)DxG(x,p)k  =  0.  Thus,  ker  Dg(x)  =  ker  Q(x,p)DxG(x,p)  and,  since  the  null-space 
of  (7.16)  is  {0}  x  [ker  Dg(x)Dker DpG(x,p)],  its  dimension  will  be  constant  exactly  if  (7.14) 
holds.  □ 

Propositions  7.1  and  7.2  show  how  reducibility  of  a  7r-submanifold  M  can  be  checked 
in  the  neighborhood  of  a  given  point  (xo,po)  after  a  natural  tangent  bundle  chart  has 
been  chosen  to  reduce  the  problem  to  that  characterized  by  the  Local  Assumption.  When 
(x0,po)  belongs  to  the  reduction  M'  of  M  and  has  order  p  in  Af,  then  an  analytic  charac¬ 
terization  of  M'  near  (x0,po)  is  given  by 

Dg(x)p  =  0  (€  Rm~p ), 

(7.17) 

G(x,p)  =  o  (eRm), 

where  we  constructed  g  :  Rm  — ►  as  a  submersion  at  xo  for  which  the  zero  set  p-1(0) 

coincides  locally  near  xo  with  a  local  projection  W  of  M  at  (xo,po)-  Then  dimW  =  p 
and  M'  is  localy  near  (xo,Po)  a  p-dimensional  submanifold  of  TW  which  therefore  can  be 
represented  as  if-1(0)  with  some  submersion  H  :  TW  — »  Rp  at  (xo,po)- 

Once  H  is  known  we  can  use  a  natural  tangent  bundle  chart  on  TW  to  introduce  the 
representations  W  =  Rp,  TW  =  TRP  which  means  that,  once  again  the  Local  Assumption 
holds  but  with  M  replaced  by  M\  =  M'  and  G  replaced  by  the  representation  G i  of  H  in 
the  bundle  chart.  Thus  Propositions  7.1  and  7.2  can  be  applied  again.  A  natural  tangent 
bundle  chart  for  TW  is  easily  constructed  once  g  has  been  determined  from  G  as  discussed 
above.  This  leaves  us  with  the  question  of  finding  H  in  terms  of  the  data  available  from 
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the  Local  Assumption;  that  is,  in  terms  of  the  mapping  G  characterizing  M  near  (x0,po). 
For  this  it  may  be  noted  that  although  the  system  (7.17)  characterizes  Mi  near  (x0,p0) 
and  Dg(x)p  =  0  holds  whenever  ( x,p )  €  T\V,  we  cannot  choose  H  =  G  for  then  H  maps 
into  Rm  and  not  into  Rp. 

We  begin  by  representing  the  manifold  M  locally  near  (x0,po)  as  the  zero  set  of  another 
submersion  G  from  which  H  can  then  be  obtained.  Let  B  €  £(Rm-p,Rm)  be  chosen  such 
that 

(7.18)  rge  B  n  rge  DpG(x0,p0)  =  {0}. 

Such  a  mapping  B  is  necessarily  one-to-one.  Moreover,  set 

(7.19)  V'  =  kerDpG(x0,po),  dim  V' =  m  —  p, 

and  let  V"  be  any  complement  of  V'  in  Rm.  We  shall  henceforth  write  p  €  Rm  in  the 
form  p  =  p'  +  p"  in  accordance  with  the  decomposition  Rm  =  V'  ©  V" .  There  should  be 
no  confusion  with  the  corresponding  notation  used  earlier  relative  to  the  decomposition 
Rm  =  Z'  ©  Z"  since  the  two  decompositions  will  not  appear  at  the  same  time. 

By  the  choice  of  V"  the  mapping 

(k"J)  6  V"  x  R"1-"  ~  DpG(x0,Po)k"  +  Bt  <E  Rm, 

is  one-to-one  and  hence  a  linear  isomorphism.  It  thus  follows  from  the  implicit  function 
theorem  that  near  (xo,po,0)  the  zero  set  of  the  mapping 

(x,p,q)  G  TRm  x  Rm-P  G(x,p)  +Bq  €  Rm, 

is  a  (2m  —  p)-dimensional  submanifold  M  of  TRm  x  characterized  by 

(7.20)  (x,p,q)£  A/ {p"  =  A(x.p'),  q  -  p(x,p')}, 
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where  A  and  p  are  smooth  functions  near  (r0,Po)  such  that  A(x0,Po)  =  P o',  p(x0,Po)  =  0. 
For  (x,p)  G  TRm  near  (x0,po)  let 

(7-21)  H(x,p)  =  p"  -  A (x,p')  G  V"  ~  Rp, 

and 

(7-22)  G(x,p)  =  (g(x),H(x,p))  G  Rm“'  x  F"  ~  Rm. 

Since  Dp"H(xo,po)  =  Iv"  we  see  that  DpH(xo,po )  is  surjective  and  hence,  because  g  is 
a  submersion  at  x0i  that  G  is  a  submersion  at  (x0,po).  Now,  observe  that  for  (x,p)  €  M 
near  (xo,poJ  we  have  G(x,p)  =  0  whence  (x,p, 0)  €  M.  In  particular,  (7.20)  implies 
p"  =  A (x,p);  that  is,  H(x,p)  =  0.  On  the  other  hand,  from  G(x,p)  =  0  we  also  obtain 
g(x)  =  0  and  therefore  G(r,p)  =  0.  Thus,  locally  near  (xo,po),  we  have 

(x,p)  G  M  =»  (x, p)  G  G-1(0). 


But  since  G  is  a  submersion  at  (xo,po)  its  zero  set  near  (xo,po)  is  a  submanifold  of 
TR"1  with  the  same  dimension  m  as  M  so  that  M  and  G-1(0)  coincide  in  the  vicinity  of 
(x0,Po)-  This  means  that,  in  all  previous  considerations  we  may  use  G  instead  of  G  when 
desirable.  More  specifically,  the  conditions  expressed  in  Propositions  7.1  and  7.2  in  terms 
of  G  are  equivalent  to  the  corresponding  conditions  expressed  in  terms  of  G  (because  with 
both  choices  these  conditions  translate  geometric,  hence  intrinsic,  properties).  Therefore 
if  (x0,po)  6  TW  n  M  and  the  mapping  (7.13)  has  full  rank  2m  -  p  (as  it  must  if  M  is 
reducible),  then  the  mapping  obtained  by  replacing  G  by  G  in  (7.13)  has  full  rank  2m  —  p 
as  well.  As  was  seen  in  the  proof  of  Proposition  7.2,  the  full  rank  condition  for  the  mapping 
(7.13)  is  equivalent  to 

f  D-g(x0){po,  •)  Dg{x 0)  \ 

rank  =2 m  -  p, 

\  DrG(x0,p0 )  DrG(x0,p0) ) 
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and  hence  to 


D2g(x0)(p0 ,•)  Dg(x) 

DxG(x0,po)  DpG(x0,po) 


(7.23) 


^  =  2m  —  p. 


Hence  for  v"  €  V"  and  with  (7.22)  and  (7.23)  we  see  that  there  exists  some  (h,k)  € 
Rm  x  Rm  such  that 

D2g(x0)(po,h)  +  Dg(x0)k  =  0, 

Dg(x0)h  =  0, 

DxH(x0,p0)h  -f  DpH(xQ,pQ)k  =  v". 

Here  the  first  two  conditions  mean  that  ( h ,  k )  €  T(X0tP0)(TW),  and,  since  v"  is  arbitrary,  it 
follows  that  DH(xo,po)\T  tw  maps  onto  V that  is,  that  H\tw  is  a  local  submersion 
at  (xo,p0)  in  TW  DM.  This  shows  that  H  =  H\t\v  can  be  chosen  to  characterize  TW  fl  M 
and  hence  M'  as  H~1( 0)  in  a  neighborhood  of  (x0,p0)  and,  as  desired,  by  (7.21)  H  maps 
into  Rp. 


8.  Relationship  with  the  Earlier  Theory. 

We  use  again  the  setting  of  the  previous  section  and  discuss  now  the  special  case  when 
the  point  (x0,p0 )  has  local  index  1  and  relate  the  results  to  our  earlier  theory  developed 
in  [11].  This  will  show  that  in  this  case  the  two  approaches  essentially  coincide  and  that 
some  apparent  discrepancies  can  be  fully  explained. 

As  defined  in  Section  5,  the  point  (xotPo)  £  A/  has  local  index  1  if  there  exists  an  open 
neighborhood  U  of  (xo,po)  in  M  such  that  (i)  U  is  a  completely  reducible  7r-submanifold, 
(ii)  (x0,po)  belongs  to  the  core  C{U)  of  £/,  and  (iii)  the  connected  component  of  C(U) 
containing  (x0,po)  is  a  connected  component  of  the  reduction  Ui  =  U'  of  U  but  not  of  U 
itself  (for  otherwise  the  index  would  be  0).  In  particular,  this  means  that  U\  is  reducible 
and  the  reduction  U[  of  U\  equals  U\. 

Under  the  conditions  of  Propositions  7.1  and  7.2  and,  in  particular,  if  (xo,po)  €  TW  D 
M,  then  there  exists  an  open  neighborhood  U  of  (xo,po)  in  A/  which  is  a  reducible  7r- 
submanifold  and  (x0,po)  €  U\.  By  shrinking  U  and  therefore  also  U\  we  may  assume 
that  U i  C  TW  where  IF  is  the  local  projection  of  M  near  (x0,po)  used  before.  Hence, 
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a  necessary  and  sufficient  condition  for  L\  to  be  reducible  with  U[  =  U\  is  given  by 
Proposition  4.2  with  M  replaced  by  U\  and  Y  by  W.  More  specifically,  since  it  is  here 
linear.  Proposition  4.2  requires  that  ■  be  a  linear  isomorphism  from  T{Zo<po)U\  to  TZo XV. 
Note  that  it  is  actually  required  that  the  condition  holds  for  every  ( x,p )  €  U\.  But  because 
U\  can  be  shrunk  to  arbitrarily  small  size,  it  suffices  indeed  to  require  the  condition  only 
at  (x0,po)  since  it  then  holds  automatically  at  all  nearby  points.  In  summary,  under  the 
conditions  of  Propositions  7.1  and  7.2  we  have 


(8.1)  (xo,Po)  £  TW  fl  M  has  local  index  1  kerff|7>(xo  po)(T\vnM)  =  {0}. 

This  provides  us  with  the  following  characterization: 

Theorem  8.1.  Under  the  Local  Assumption  the  point  (xo,po)  6  A/  has  local  index  1  if 
and  only  if 

(i)  rank  DpG(x,p)  =  p  is  constant  for  all  (x,p)  near  (x0,p o)  in  M. 

( ii)  Dx G( x0 , po )p0  €  rge  DpG{x0,Po). 

(iii )  For  any  k  €  Km  the  implication 

DxG(x0,po)k  €  rge  DpF(xo.po)) 

(8.2)  \  =>  k  =  0. 

DpG(xq,po  )k  =  0  J 


holds. 


Proof.  The  necessity  of  (i)  and  (ii)  is  obvious  since  (i)  is  required  by  Proposition  7.1  and 

(ii)  merely  expresses  the  assumption  that  (xo,po)  €  TW  D  M.  Now  note  that  (8.2)  is  an 
equivalent  formulation  of  the  condition  on  the  right  of  (S.l)  since  in  the  proof  of  Proposition 
7.2  the  null-space  of  ~\ t(Xq  Pol(T\vn.\i)  was  seen  to  be  equal  to 


{0}  x  [ker  QoDxG(xo,po )  fl  ker  DpG(xo,po)] 


where  Q0  is  a  projection  onto  a  complement  of  rge  DpG(x0,p0).  Hence  the  necessity  of 

(iii)  follows  from  the  equivalence  (S.l). 
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For  the  proof  of  the  sumciencv  we  have  to  show  that  ail  the  conditions  of  Propositions 
7.1  and  7.2  are  satisfied.  Then  (S.2)  will  ensure  that  the  condition  on  the  right  side  of 
(S.l)  holds. 

Clearly  the  conditions  of  Proposition  7.1  hold  and  so  does  the  condition  (xo,po)  6 
TXV  fl  M  of  Proposition  7.2  which  is  our  condition  (ii).  We  now  show  that  the  mapping 
(7.13)  has  the  full  rank  2 m  —  p.  For  this  let  Z0  and  Q0  be  as  given  in  Proposition  7.2 
and  (u,u)  £  Zq  x  Rm.  Since  G  is  a  submersion  at  (xo.po)  there  exists  some  ( h,k )  £  TRm 
such  that  DzG(x0,p0)h  -f  DpG(x0,Po)k  =  v  and  k  may  be  replaced  by  k  4-  i  with  any 
£  £  kerDpir(x0,po)  without  affecting  this  relation.  By  (8.2)  we  have  ker  Q0DzG(x0,po )  Fl 
ker-DpGh'xt^po)  =  {0};  that  is,  Q0DxG(xn,pQ)  is  one-to-one  on  ker £>pG(xo,po )  and  hence 
a  linear  isomorphism  of  ker  DpG(x0,Po)  onto  Zq.  Thus,  i  6  kerJDpG(x0,po)  can  be  found 
so  that 

Q0DrG(x0,po)£  =  u  -  QoD2xG(x0,po)(po,h)  -  Q0DzG(x0,po )k. 

which  proves  the  claim. 

Finally,  we  show  that  (7.14)  holds.  As  noted  above  the  right-hand  side  of  (7.14)  is  zero 
and  it  suffices  to  prove  that  for  (x.p)  £  TW  fl  M  near  (x0,po)  we  have,  with  the  same 
notation  as  in  Proposition  7.2, 

(S.3)  ker  Q{x,p)DzG(x,p)  fl  ker  DpG(x.p)  =  {0}. 

We  observed  that  (7.14)  is  independent  of  the  specific  choice  of  the  projections  Q(x,p). 
In  particular,  we  may  choose  Q(x.p)  to  depend  continuously  on  (x.p)  £  M.  In  fact,  let 
{ej,...  ,e°p }  be  an  orthonormal  basis  of  rge  DpG{x0,p0),  relative,  say.  to  the  canonical 

inner  product  of  Rm,  and  choose  u-i . wp  £  Rm  such  that  e°t  =  DpG(x0,po  )irt  for 

i  =  1 . p.  By  continuity,  the  vectors  e,(x-.p  j  =  DpG{x.p)wt,  i  =  1 . P  remain  linearly 

independent  for  (x.p)  £  M  near  (x0,po)  and  hence  by  (i)  they  span  a  space  of  dimension 

p  =  dim  rge  DpG(x.p).  Therefore,  we  have  rge  DpG{x.p)  =  span  {ei(x.p) . ep(x.p)} 

and  by  applying  the  Gram-Schmidt  process  to  the  vectors  e,(x.p)  we  obtain  an  orthonormal 
basis  of  rge  DrG{x.p).°  As  a  sum  of  dyadic  products  of  the  vectors  of  this  basis,  the 

5  Recall  that  the  Gram-Schmidt  process  involves  only  continuous  operations. 
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orthogonal  projection  P(x.p)  of  rge  DpG(x.p )  depends  continuously  on  (x.p)  £  M  near 
(x0,Po ),  and  hence  the  same  is  true  of  Q(x.p)  =  I  —  P(x.p). 

It  only  remains  to  show  that  (S.3)  holds  for  all  (x.p)  £  TW fl  IF  near  (x0,p0)-  Suppose 
in  the  contrary  that  there  exist  sequences  (xj.pj)  £  TW  D  M  and  k:  £  Rm  such  that, 
for  all  j  >  0,  (Xj,pj)  — ►  (x0,po)  and  kj  is  a  unit  vector  under  some  norm,  for  which 
Q(xjiPj)DiG(xj,Pj)kj  =  DpG(xj,pj)kj  =0.  By  extracting  a  subsequence  we  may  asstime 
that  kj  —*  k  where  k  £  Rm  again  is  a  unit  vector.  By  continuity  it  then  follows  that 
k  £  kerQ(x0,po)DxG(x0,p0)  fl  kev  DpG(x0,po)  which  is  a  contradiction.  □ 

Theorem  8.1  can  be  compared  with  a  related  result  in  [11]  where  a  different  approach 
is  taken  and  a  different  definition  for  the  index  is  given.  It  turns  out  that  the  result  in 
[11]  corresponding  to  Theorem  S.l  -  when  phrased  in  our  present  notation  -  requires  the 
conditions: 

Condition  (i)\  rank  DpG(x.p)  =  p  is  constant  for  all  (x,p)  near  (x0,po)  in  TRm. 
Condition  (ii)’.  DzG(x0,p0  )Po  €  rge  DpG{x0,p0). 

Condition  (iii)\  If  for  any  vectors  k  £  Rm  and  qQ  £  Rm  such  that  DzF(x0,p0)p0  + 
DpF(x0,po)q0  =  0  the  relations 

•^)i.p(^(xo,Po)(Po,  k)  +  D-pG(x0,p0)(q0,k)  +  DzG(x0,p0)k  £  rge  DpG(x0,p0) 
DpG(x0,p0)k  =  0 

hold ,  then  k  =  0. 

In  condition  (iii)’  it  may  be  noted  here  that  the  requirement  for  the  vector  q0  is  consistent 
with  (ii)'  and  that  D2Z  pG(x0.po  )(po,  k)  stands  for  D2G(x0,po)({po,  0),  (0.  k)). 

It  immediately  appears  that  condition  (i)  of  Theorem  S.l  is  weaker  than  (i)’  since  con¬ 
stancy  of  the  rank  of  DpG(x.p )  is  required  only  in  a  neighborhood  of  (xo.po)  in  M.  The 
approach  taken  in  [11]  does  not  go  through  under  this  weaker  assumption.  Conditions  (ii) 
and  (ii)'  are  the  same  for  both  results  but  an  obvious  discrepancy  exists  among  the  condi¬ 
tions  (iii)  and  (iii)’  since  the  second  derivative  of  G  plays  no  role  in  Theorem  8.1.  However, 
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the  discrepancy  is  removable.  In  fact,  suppose  tiiat  for  (xo.po/  <=  -V  the  conditions  ii)‘. 
(ii)\  and  ( iii ) '  hold.  Then  for  any  k  €  her DpF(x(i,p0)  we  have 

D2Z  pG(x0.p0){po,k)  +  D2G(x0,Po){qo,  k)  6  rge  DpG(x0.po ), 

and  hence  the  condition  (iii)'  coincides  with  condition  (iii)  of  Theorem  8.1. 

In  order  to  see  this  observe  that 

D\  pG{x0,po)(p0,k)  +  D2pG(x0,po){q0,k) 

(8.4)  =D2G(x0,p0)((p0,Q),  (0,  k))  +  D2G(io.p0)((0.q0),(0.k)) 
=DzG(xQ>po)((po,qo),(0,k))  =  D[DpG(x.p)k]{po,qo)\(X'p)=(ZO'Po) 

=  T  tp0,po  +  tq0)k]UstO. 

By  a  method  identical  to  the  one  used  in  the  proof  of  Theorem  8.1  it  was  shown  in  [11] 
that  the  condition  (i)’  implies  that  for  all  (x.p)  €  TRm  sufficiently  close  to  (x0.po)  a 
projection  Q(x.p)  onto  a  complement  of  rge  DpG(x,p)  can  be  chosen  which  depends  not 
only  continuously  but  smoothly  upon  (x.p).  Thus,  with 

(8.5)  q(t)  =  Q(x0  +  tpo,po  4-  tq0),  p(t)  =  DpG(x0  +  tpo.po  t  tqQ)k. 

it  follows  that  q(t)p(t)  =  0  for  all  sufficiently  small  |f|  and  by  differentiating  this  identity 
at  t  —  0  we  obtain 

^(0)y(0)  +  7(0)%t0)  =  0. 

at  at 

Since  for  k  £  her  DpG(x0.p0 ),  r-(0)  =  0.  this  relation  reduces  to  q(0)(d<p/dt )( 0)  =  0  and 
because  0 )  =  Q(xo-Po )  projects  onto  a  complement  of  rge  DpG(xq. p0)  this  reads 

-r“(0)  €  rge  DpG(xo.pu), 
at 

and  the  conclusion  follows  from  (S.4)  and  (8.5). 
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For  problems  with  arbitrary  index,  there  is  also  some  similarity  between  the  methods 
of  this  paper  and  those  of  11].  Indeed,  for  (x0,p0)  in  the  core  C(M)  of  M  it  follows 
from  the  analytic  results  of  Section  G  that  the  structure  of  C(M)  neax  (xo>Po)  can  be 
determined  from  a  sequence  of  mappings  G o, .  •  -  .  Gu  where  v  is  the  index  of  (x0,Po)  and 
CxF^O)  represents  C{M)  near  (x0,p0)  in  a  local  chart.  This  is  reminiscent  of  what  was 
done  in  [11]  where  also  a  sequence  of  mappings  F0, . . .  ,  F„<  is  defined  inductively.  But  an 
important  difference  is  that  the  G}  are  defined  over,  and  map  into,  spaces  with  smaller 
and  smaller  dimension  while  no  reduction  of  dimension  occurs  with  the  Fj  in  [11]. 
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